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TUMOR NECROSIS FACTOR-a AND -0 RECEPTORS 



BACKGROUND OF THE INVENTION 



The present invention relates generally to cytokine receptors and more specifically to tumor necrosis 
5 factor receptors. 

Tumor necrosis factor-a (TNFa. also known as cachectin) and tumor necrosis factor-^ (TNF/3. also 
known as lymphotoxin) are homologous mammalian endogenous secretory proteins capable of Inducing a 
'Mde variety of effects on a large number of cell types. The great similarities in the structural and functional 
characteristics of these two cytokines have resulted in their collective description as "TNF." Complementary 

10 cDNA clones encoding TNFa (Pennica et aL, Nature 5/2:724. 1984) and TNF/8 (Gray et al.. A/art/re 
3/2:721, 19S4) have been Isolated, permitting further structural and biological characterization of TNF. 

TNF proteins initiate their biological effect on cells by binding to specific TNF receptor (TNF-R) proteins 
expressed on the plasma membrane of a TNF-responsive cell. TNFa and TNF^ were first shown to bind to 
a common receptor on the human cervical carcinoma cell line ME-180 (Aggarwal et al., Nature 

ts 3/9:685,1985). Estimates of the size of the TNF-R determined by affinity labeling studies ranged from 54 to 
175 kDa (Creasey et al. Proc. Natl, Acad, Set USA 04:3293, 1987; Stauber et al.. J. Bio!. Chem. 
263:19098. 1988; Hohmann et al., J. Biol. Chem. 264:14927, 1989). Although the relationship between 
these TNF-Rs of different molecular mass is unclear. Hohmann et al. {J. Biol, Chem. 264:14927. 1989) 
reported that at least two different cell surface receptors for TNF exist on different cells kOa, respectively. 

20 None of the above publications, however, reported tiie purification to homogeneity of cell surface TNF 
receptors. 

In addition to cell surface receptors for TNF, soluble proteins from human urine capable of binding TNF 
have also been identified (Peetre et al.. Euc J. Haematol. 4/:414. 1988; Seckinger et al.. J. Exp. Med. 
/67:1511, 1988; Seckinger et al.. J. Biol. Chem, 264:11966, 1989; UK Patent Application, Publ. No. 2 218 

25 101 A to Seckinger et al.; Engelmann et ai.. J, Biol, Chem, 264:11974, 1989). The soluble urinary TNF 
binding protein disclosed by UK 2 218 101 A has a partial N-terminal amino acid sequence of Asp-Ser-Val- 
Cys-Pro-, which corresponds to tiie partial sequence disclosed later by Engelmann et al. (1989). The 
relationship of the above soluble urinary binding proteins was further elucidated after original parent 
application (U.S. Serial No. 403.241) of the present application was filed, when Engelmann et al. reported 

30 tile identification and purification of a second distinct soluble urinary TNF binding protein having an N- 
terminal amino acid sequence of Vai-Ala-Phe-Thr-Pro- (J. Biol. Chem. 265:1531. 1990). The two urinary 
proteins disclosed by the UK 2 216 101 A and the Engelmann et al. publications were shown to be 
immunochemically related to two apparentiy distinct ceil surface proteins by the ability of antiserum against 
the binding proteins to inhibit TNF binding to certain ceils. 

35 More recently, two separate groups reported tiie molecular cloning and expression of a human 55 kOa 
TNF-R (Loetscher et al.. Cell 6/:351, 1990; Schall et al.. Cell 6/:361. 1990). The TNF-R of both groups has 
an N-lerminal amino acid sequence which corresponds to the partial amino acid sequence of the urinary 
binding protein disclosed by UK 2 218 101 A, Engelmann et al. (1989) and Englelmann et al. (1990). 

In order to elucidate the relationship of tiie multiple forms of TNF-R and soluble urinary TNF binding 

40 proteins, or to study the structural and biological characteristics of TNF-Rs and tiie role played by TNF-Rs 
in the responses of various cell populations to TNF or other cytokine stimulation, or to use TNF-Rs 
effectively in therapy, diagnosis, or assay, purified compositions of TNF-R are needed. Such compositions, 
however, are obtainable in practical yields only by cloning and expressing genes encoding the receptors 
using recombinant DNA technok)gy. Efforst to purify tiie TNF-R molecule for use in biochemical analysis or 

46 to clone and express mammalian genes encoding TNF-R, however, have been impeded by lack of a 
suitable source of receptor protein or mRNA. Prior to the present invention, no cell lines were known to 
express high levels of TNF-R constitutively and continuously, which precluded purification of receptor for 
sequencing or construction of genetic libraries for cDNA cloning. 

50 

SUMMARY OF THE INVENTION 



The present invention provides isolated TNF receptors and DNA sequences encoding mammalian 
tumor necrosis factor receptors (TNF-R). in particular, human TNF-Rs. Such DNA sequences include (a) 
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cDNA clones having a nucleotide sequence derived from the coding region of a native TNF-R gene; (b) 
DNA sequences which are capable of hybridization to the cDNA clones of (a) under moderately stringent 
conditions and which encode biologically active TNF-R molecules; or (c) DNA sequences which are 
degenerate as a result of the genetic code to the DNA sequences defined in (a) and (b) and which encode 

5 biologically active TNF-R molecules. In particular, the present invention provides DNA sequences which 
encode soluble TNF receptors. 

The present invention also provides recombinant expression vectors comprising the DNA sequences 
defined above, recombinant TNF-R molecules produced using the recombinant expression vectors, and 
processes for producing the recombinant TNF-R molecules using the expression vectors. 

10 The present invention also provides isolated or purified protein compositions comprising TNF-R. and, In 
particular, soluble forms of TNF-R. 

The present invention also provides compositions for use in therapy, diagnosis, assay of TNF-R, or in 
raising antibodies to TNF-R, comprising effective quantities of soluble native or recombinant receptor 
proteins prepared according to the foregoing processes. 

15 Because of the ability of TNF to specifically bind TNF receptors (TNF-Rs). purified TNF-R compositions 
will be useful in diagnostic assays for TNF, as well as in raising antibodies to TNF receptor for use in 
diagnosis and therapy. In addition, purified TNF receptor compositions may be used directly in therapy to 
bind or scavenge TNF, thereby providing a means for regulating the immune activities of this cytokine. 
These and other aspects of the present invention will become evident upon reference to the following 

20 detailed description. 

BRIEF DESCRIPTION OF THE DRAWINGS 

25 

Figure 1 is a schematic representation of the coding region of various cDNAs encoding human and 
murine TNF-Rs. The leader sequence is hatched and the transmembrane region is solid. 
Figure 2A-2B depict the partial cDNA sequence and derived amino acid sequence of the human TNF-R 
clone 1. Nucleotides are numbered from the beginning of the S untranslated region. Amino acids are 

30 numbered from the beginning of the signal peptide sequence. The putative signal peptide sequence is 
represented by the amino acids -22 to -1. The N-terminal leucine of the mature TNF-R protein is 
underlined at position 1. The predicted transmembrane region from amino acids 236 to 265 is also 
underiined. The C-termini of various soluble TNF-Rs are mari<ed with an anrow (t). 
Figure 3A-3C depict the cDNA sequence and derived amino acid sequence of murine TNF-R clone 1 1 . 

35 Tlie putative signal peptide sequence is represented by amino acids -22 to -1. 1. The N-tenminal valine 
of the mature TNF-R protein is underlined at position 1. The predicted transmembrane region from 
amino acids 234 to 265 is also underlined. 

DETAILED DESCRIPTION OF THE INVENTION 

40 



Definitions 

46 As used herein, the terms "TNF receptor" and "TNF-R" refer to proteins having amino acid sequences 
which are substantially similar to the native mammalian TNF receptor amino acid sequences, and which are 
biologically active, as defined below, in that they are capable of binding TNF molecules or transducing a 
biological signal initiated by a TNF molecule binding to a cell, or cross-reacting with anti-TNF-R antibodies 
raised against TNF-R from natural {i.e., nonrecombinant) sources. The mature full-length human TNF-R is a 

50 glycoprotein having a molecular weight of about 80 kilodaitons (kDa). As used throughout the specification, 
the term "mature" means a protein expressed in a form lacking a leader sequence as may be present in 
full-length transcripts of a native gene. Experiments using COS cells transfected with a cONA encoding full- 
length human TNF-R showed that TNF-R bound '2S|-tnFo with an apparent Ka of about 5 x 10^ M"\ and 
that TNF-R bound ^25|-tnF;3 with an apparent Ko of about 2 x 10^ M-\ The terms "Tr'F receptor" or "TNF- 

56 R" include, but are not limited to, analogs or subunits of native proteins having at least 20 amino acids and 
which exhibit at least some biological activity in common with TNF-R. for example, soluble TNF-R 
constructs which are devoid of a transmembrane region (and are secreted from the cell) but retain the 
ability to bind TNF. Various bioequivalent protein and amino acid analogs are described in detail below. 
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TTie nomenclature for TNF-R analogs as used herein follows the convention of naming the protein (e.g., 
TNF-R) preceded by either hu (for human) or mu (for murine) and followed by a A (to designate a deletion) 
and the number of the C-lerminaJ amino acid. For example. huTNF-RA235 refers to human TNF-R having 
Asp23s as the C-terminal amino acid (i.e., a polypeptide having the sequence of amino acids 1-235 of Figure 

s 2A). In the absence of any human or murine species designation, TNF-R refers generically to mammalian 
TNF-R. Similarly, in the absence of any specific designation for deletion mutants, the term TNF-R means all 
fomns of TNF-R, including mutants and analogs which possess TNF-R biological activity. 

"Soluble TNF-R" or "sTNF-R" as used in the context of the present Invention refer to proteins, or 
substantially equivalent analogs, having an amino acid sequence corresponding to all or part of the 

10 extracellular region of a native TNF-R. for example, huTNF-RA235. huTNF-RA185 and huTNF-RA183, or 
amino acid sequences substantially similar to the sequences of amino acids 1-163. amino acids 1-185. or 
amino acids 1-235 of Rgure 2A, and which are biologically active In that they bind to TNF Ifgand. 
Equivalent soluble TNF-Rs include polypeptides which vary from these sequences by one or more 
substitutions, deletions, or additions, and which retain the ability to bind TNF or inhibit TNF signal 

IS transduction activity via cell surface bound TNF receptor proteins, for example huTNF-RAx. wherein x is 
selected from the group consisting of any one of amino acids 163-235 of Rgure 2A, Analogous deletions 
may be made to muTNF-R. Inhibition of TNF signal transduction activity can be determined by transfecting 
cells with recombinant TNF-R DNAs to obtain recombinant receptor expression. The cells - are then 
contacted with TNF and the resulting metabolic effects examined. If an effect results which is attributable to 

20 the action of the ligand. then the recombinant receptor has signal transduction activity. Exemplary 
procedures for detennining whether a polypeptide has signal transduction activity are disclosed by idzerda 
et al.. J. Exp, Med. /7/:861 (1990): Curtis et al.. Proc, Natl. Acad. Sci, USA 35:3045 (1989); Prywes et al., 
EMBO J. 5:2179(1986) and Chou et al., J. Biol. Chem. 252:1842 (1987). Alternatively, primary cells or cell 
lines which express an endogenous TNF receptor and have a detectable biological response to TNF could 

25 also be utilized. 

The term "isolated" or "purified", as used in the context of tiiis specification to define \he purity of 
TNF-R protein or protein compositions, means that the protein or protein composition is substantially free of 
otiier proteins of natural or endogenous origin and contains less than alxut 1% by mass of protein 
contaminants residual of production processes. Such compositions, however, can contain other proteins 

30 added as stabilizers, carriers, excipients or co-tiierapeutics. TNF-R is isolated if it is detectable as a single 
protein band in a polyacrylamide gel by silver staining. 

The term "substantially similar," when used to define either amino acid or nucleic acid sequences, 
means that a particular subject sequence, for example, a mutant sequence, varies from a reference 
sequence by one or more substitutions, deletions, or additions, the net effect of which Is to retain biological 

35 activity of the TNF-R protein as may be determined, for example, in one of the TNF-R binding assays set 
fortti in Example 1 t>elow. Alternatively, nucleic acid subunits and analogs are "sut>stantlally similar" to the 
specific DNA sequences disclosed herein if: (a) the DNA sequence is derived from the coding region of a 
native mammalian TNF-R gene; (b) tfie DNA sequence is capable of hybridization to DNA sequences of (a) 
under moderately stringent conditions (50 'C. 2x SSC) and which encode biologically active TNF-R 

40 molecules; or DNA sequences which are degenerate as a result of the genetic code to the DNA sequences 
defined in (a) or (b) arxi which encode biologically active TNF-R molecules. 

"Recombinant," as used herein, means tiiat a protein is derived from recombinant (e.g.. microbial or 
mammalian) expression systems. "Microbial" refers to recombinant proteins made in bacterial or fungal 
(e.g.. yeast) expression systems. As a product, "recombinant microbial" defines a protein produced in a 

45 microbial expression system which is essentially free of native endogenous substances. Protein expressed 
in most bacterial cultures, e.g.. £ colh will be free of glycan. Protein expressed in yeast may have a 
glycosylation pattern different from that expressed in mammalian cells. 

"Biologically active," as used throughout the specification as a characteristic of TNF receptors, means 
that a particular molecule shares sufficient amino acid sequence similarity with the embodiments of the 

50 present invention disclosed herein to be capable of binding detectable quantities of TNF, transmitting a TNF 
stimulus to a cell, for example, as a component of a hybrid receptor construct or cross-reacting with anti- 
TNF-R antibodies raised against TNF-R from natural (i.e., nonrecombinant) sources. Preferably, biologically 
active TNF receptors within .the scope of the present invention are capable of binding greater than 0.1 
nmoles TNF oer nmole receptor, and most preferably, greater tiian 0.5 nmole TNF per nmole receptor in 

55 standard binding assays (see below). 

"Isolated DNA sequence" refers to a DNA polymer, in the form of a separate fragment or as a 
component of a larger DNA construct, which has been derived from DNA isolated at least once In 
substantially pure form, i.e.. free of contaminating endogenous materials and in a quantity or concentration 
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enabling identification, manipulation , and recovery of the sequence and its component nucleotide se* 
quences by standard biochemical methods, for example, using a cloning vector. Such sequences are 
preferably provided in the fonm of an open reading frame uninterrupted by internal nontranslated se- 
quences, or introns, which are typically present in eukaryotic genes. Genomic ON A containing the relevant 

5 sequences could also be used as a source of coding sequences. Sequences of non-translated DNA may be 
present 5' or 3' from the open reading frame, where the same do not interfere with manipulation or 
expression of the coding regions. 

"Nucleotide sequence" refers to a heteropolymer of deoxyribonucleotides. ONA sequences encoding 
the proteins provided by this invention can be assembled from cDNA fragments and short oligonucleotide 

10 llnlcers, or from a series of oligonucleotides, to provide a synthetic gene which is capable of being 
expressed in a recombinant transcriptional unit. 

Isolation of cDNAs Encoding TNF-R 

16 

The coding sequence of TNF-R is obtained by isolating a complementary DNA (cDNA) sequence 
encoding TNF-R from a recombinant cDNA or genomic DNA library. A cDNA library is preferably 
constructed by obtaining polyadenylated mRNA from a particular cell line which expresses a mammalian 
TNF-R, for example, the human fibroblast cell line WI-28 VA4 (ATCC CCL 95.1) and using the mRNA as a 

20 template for synthesizing double stranded cDNA. The double stranded cDNA is tiien packaged into a 
recombinant vector, which is introduced into an appropriate E, coif strain and propagated. Murine or other 
mammalian cell lines which express TNF-R may also be used. TNF-R sequences contained in the cDNA 
library can be readily identified by screening the library with an appropriate nucleic acid probe which is 
capable of hybridizing witti TNF-R cDNA. Alternatively. DNAs encoding TNF-R proteins can be assembled 

25 by ligation of syntiietic oligonucleotide subunits corresponding to ail or part of ttie sequence of Figures 2A- 
2B or 3A-3C to provide a complete coding sequence. 

The human TNF receptor cDNAs of tiie present invention were isolated by the method of direct 
expression cloning. A cDNA library was constructed by first isolating cytoplasmic mRNA from the human 
fibroblast cell line WI-26 VA4. Polyadenylated RNA was isolated and used to prepare double-stranded 

30 cDNA. Purified cDNA fragments were then ligated into pCAV/NOT vector DNA which uses regulatory 
sequences derived from pDC201 (a derivative of pMLSV, previously described by Cosman et al.. Nature 
3^2:768, 1984), SV40 and cytomegalovirus DNA, described in detail below in Example 2, pCAV/NOT has 
been deposited with the American Type Culture Collection under accession No. ATCC 68014. The 
pCAV/NOT vectors containing the WI26-VA4 cDNA fragments were transformed into £ coif strain DH5o. 

36 Transformants were plated to provide approximately 800 colonies per plate. The resulting colonies were 
harvested and each pool used to prepare plasm id DNA for transfection into COS-7 cells essentially as 
described by Cosman et al. (Nature 3/2:768. 1984) and Luthman et al. {Nuct, Acfd Res, r/:1295. 1983). 
Transformants expressing biologically active cell surface TNF receptors were identified by screening for 
their ability to bind ^^si-xNF. In this screening approach, transfected COS-7 cells were Incubated with 

40 medium containing ^^si.yis^p^ the cells washed to remove unbound labeled TNF. and the cell monolayers 
contacted with X-ray film to detect concentrations of TNF binding; as disclosed by Sims et al. Scfence 
24f:58S (1988). Transfectants detected in this manner appear as dark foci against a relatively light 
background. 

Using this approach, approximately 240,000 cDNAs were screened in pools of approximately 800 
45 cDNAs until assay of one transfectant pool indicated positive foci for TNF binding. A frozen stock of 
bacteria from tiiis positive pool was grown in culture and plated to provide individual colonies, which were 
screened until a single clone (clone 11) was identified which was capable of directing synthesis of a surface 
protein with detectable TNF binding activity. The sequence of cDNA clone 1 1 Isolated by the above mettiod 
is depicted in Rgures 3A-3C. 
50 Additional cDNA clones can be isolated from cDNA libraries of other mammalian species by cross- 
species hybridization. For use in hybridization, DNA encoding TNF-R may be covalently labeled with a 
detectable substance such as a fluorescent group, a radioactive atom or a chemiluminescent group by 
methods well known to those skilled in the art. Such probes could also be used for in vitro diagnosis of 
particular conditions. 

55 Like most mammalian genes, mammalian TNF receptors are presumably encoded by multi-exon genes. 
Alternative mRNA constiucts which can be attributed to different mRNA splicing events following transcrip- 
tion, and which share large regions of identity or similarity with the cDNAs claimed herein, are considered 
to be within the scope of the present invention. 
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Other mammaiian TNF-R cDNAs are isolated by using an appropriate human TNF-R DNA sequence as 
a probe for screening a particular mammalian cDNA library by cross-species hybridization. 



5 Proteins and Anaiogs 

The present invention provides isolated recombinant mammalian TNF-R polypeptides. Isolated TNF-R 
polypeptides of this invention are substantially free of other contaminating materials of natural or endog- 
enous origin and contain less than about 1% by mass of protein contaminants residual of production 

fo processes. The native human TNF-R molecules are recovered from cell lysates as glycoproteins having an 
apparent molecular weight by SDS-PAQE of about 80 kilodaitons (kOa). The TNF-R polypeptides of this 
invention are optionally without associated native-pattern glycosylatlon. 

Mammalian TNF-R of the present invention includes, by way of example, primate, human, marine, 
canine, feline, bovine, ovine, equine and porcine TNF-R. Mammalian TNF-Rs can be obtained by cross 

15 species hybridization, using a single stranded cDNA derived from the human TNF-R DNA sequence as a 
hybridization probe to isolate TNF-R cDNAs from mammalian cDNA libraries. 

Derivatives of TNF-R within the scope of the invention also include various structural forms of the 
primary protein which retain biological activity. Due to the presence of ionizable amino and carboxyl groups, 
for example, a TNF-R protein may be in the form of acidic or basic salts, or may be in neutraJ form. 

20 Individual amino acid residues may also be modified by oxidation or reduction. 

The primary amino acid structure may be modified by forming covalent or aggregative conjugates with 
other chemfcal moieties, such as glycosyl groups, lipids, phosphate, acetyl groups and the like, or by 
creating amino acid sequence mutants. Covalent derivatives are prepared by linking particular functional 
groups to TNF-R amino acid side chains or at the N-or C-termini. Other derivatives of TNF-R within the 

25 scope of this invention include covalent or aggregative conjugates of TNF-R or its fragments with other 
proteins or polypeptides, such as by synthesis in recombinant culture as N-terminal or C-terminal fusions. 
For example, the conjugated peptide may be a a signal (or leader) polypeptide sequence at the N-terminal 
region of the protein which co-translationally or post-translatlonally directs transfer of the protein from its 
site of synthesis to its site of function inside or outside of the cell membrane or wall (e.g., the yeast a-factor 

30 leader). TNF-R protein fusions can comprise peptides added to facilitate purification or identification of TNF- 
R (e.g.. poly-His). The amino acid sequence of TNF receptor can also be linked to the peptide Asp-Tyr-Lys- 
Asp- Asp-Asp-Asp-Lys (DYKDDDDK) (Hopp et al., Bio/Technotogy 6:1204.1988.) The latter sequence is 
highly antigenic and provides an epitope reversibly bound by a specific monoclonal antibody, enabling 
rapid assay and facile purification of expressed recombinant protein. This sequence Is also specifically 

35 cleaved by bovine mucosal enterokinase at the residue immediately following the Asp-Lys pairing. Fusion 
proteins capped with this peptide may also be resistant to intracellular degradation in £ coll, 

TNF-R derivatives may also be used as immunogens, reagents in receptor-based immunoassays, or as 
binding agents for affinity purification procedures of TNF or other binding ligands. TNF-R derivatives may 
also be obtained by cross-linking agents, such as M-maieimidobenzoyI succinimide ester and N-hydrox- 

40 ysuccinimide, at cysteine and lysine residues. TNF-R proteins may also be covalently bound through 
reactive side groups to various insoluble substrates, such as cyanogen bromide-activated, bisoxirane- 
aclivated, carbonyidiimidazole-activated or tosyl-activated agarose structures, or by adsorbing to polyolefin 
surfaces (with or without glutaraldehyde cross-linking). Once bound to a substrate, TNF-R may be used to 
selectively bind (for purposes of assay or purification) anti-TNF-R antibodies or TNF. 

45 The present invention also includes TNF-R with or without associated native-pattern glycosylatlon, TNF- 
R expressed in yeast or mammaiian expression systems, e.g., COS-7 cells, may be similar or slightly 
different in nrwlecular weight and glycosylatlon pattern than the native molecules, depending upon the 
expression system. Expression of TNF-R DNAs in bacteria such as £ coll provides non-glycosylated 
molecules. Functional mutant analogs of mammalian TNF-R having inactivated N-glycosylation sites can be 

50 produced by oligonucleotide synthesis and ligation or by site-specific mutagenesis techniques. These 
anak>g proteins can be produced in a homogeneous, reduced-carixshydrate form in good yield using yeast 
expression systems. N-glycosylation sites in eukaryotic proteins are characterized by the amino acid triplet 
Asn-Ai-2, where Ai is any amino acid except Pro, and Z is Ser or Thr. In this sequence, asparagine 
provides a side chain amino group for covalent attachment of carbohydrate. Such a site can be eliminated 

55 by substituting another amino acid for Asn or for residue Z, deleting Asn or Z. or inserting a non-Z amino 
acid between Ai and Z, or an amino acid other than Asn between Asn and Ai. 

TNF-R derivatives may also be obtained by mutations of TNF-R or its subunits. A TNF-R mutant, as 
referred to herein, is a polypeptide homologous to TNF-R but which has an amino acid sequence different 
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from native TNF-R because of a deletion, insertion or substitution. 

BioequivaJent analogs of TNF-R proteins may be constructed by, for example, making various 
substitutions of residues or sequences or deleting tenminal or internal residues or sequences not needed for 
biological activity. For example, cysteine residues can be deleted (e.g., Cys^'^) or replaced with ottier 

5 amino acids to prevent formation of unnecessary or incorrect intramolecular disutfide bridges upon 
renaturation. Ottier approaches to mutagenesis involve modification of adjacent dibasic amino acid residues 
to enhance expression in yeast systems in which KEX2 protease activity is present. Generally, substitutions 
should be made conservatively: i.e., the most preferred substitute amino acids are those having 
physiochemical characteristics resembling those of the residue to be replaced. Similarly, when a deletion or 

10 insertion strategy is adopted, the potential effect of the deletion or insertion on biological activity should be 
considered. Sub»startiaily simitar polypeptide sequences, as defined above, generally comprise a like 
number of amino acids sequences, alttiough C-terminal truncations for the purpose of constructing soluble 
TNF-Rs will contain fewer amino acid sequences. In order to preserve tfie biological activity of TNF-Rs, 
deletions and substitutions will preferably result in homologous or conservatively substituted sequences, 

15 meaning that a given residue is replaced by a biologically similar residue. Examples of conservative 
substitutions include substitution of one aliphatic residue for another, such as lie, Val, Leu, or Ala for one 
another, or substitutions of one polar residue for another, such as between Lys 3X\6 Arg; Qlu and Asp; or 
Gin and Asn. Other such conservative substitutions, for example, substitutions of entire regions having 
similar hydrophobicity characteristics, are well known. Moreover, particular amino acid differences between 

20 human, murine and otiier mammalian TNF-Rs is suggestive of additional conservative substitutions that 
may be made without altering tine essential biological characteristics of TNF-R. 

Subunits of TNF-R may be constructed by deleting temiinai or intemal residues or sequences. 
Particularly preferred sequences include those in which the transmembrane region and intracellular domain 
of TNF-R are deleted or substituted with hydrophilic residues to facilitate secretion of the receptor into the 

as cell culture medium. The resulting protein is referred to as a soluble TNF-R molecule which retains its 
ability to bind TNF. A particularly preferred soluble TNF-R consti^jct is TNF-RA235 (the sequence of amino 
adds 1-235 of Figure 2A), which comprises the entire extracellular region of TNF-R, terminating with Asp^^ 
immediately adjacent the transmembrane region. Additional amino acids may be deleted from the trans- 
membrane region while retaining TNF binding activity. For example, huTNF-RA183 which comprises the 

30 sequence of amino acids 1-183 of Figure 2A, and TNF-RA163 which comprises the sequence of amino 
adds 1-163 of Rgure 2A, retain tiie abilKy to bind TNF ligand as determined using the binding assays 
described below in Example 1. TNF-RA142. however, does not retain tine ability to bind TNF ligand. This 
suggests that one or both of Cys"" and Cys^" is required for fonnation of an intramolecular disulfide 
bridge for the proper folding of TNF-R. Cys^". which was deleted witiioul any apparent adverse effect on 

35 the ability of the soluble TNF-R to bind TNF, does not appear to be essential for proper folding of TNF-R. 
Thus, any deletion C-terminai to Cys*^^ would be expected to result in a biologically active soluble TNF-R, 
The present invention contemplates such soluble TNF-R consti-ucts corresponding to all or part of the 
extracellular region of TNF-R terminating witti any amino acid after Cys'*^. Other C-terminal deletions, such 
as TNF-FA157, may be made as a matter of cc'- ^'^i'^nco by cutting TNF-R cDNA witii appropriate 

40 restriction enzymes and, if necessary, reconstructing, ^pedfic sequences witii synthetic oligonucleotide 
linkers. The resulting soluble TNF-R constructs are then Inserted and expressed in appropriate expression 
vectors and assayed for the ability to bind TNF. as described in Example 1, Biologically active soluble TNF- 
Rs resulting from such constructions are also contemplated to be within the scope of the present Invention. 
Mutations in nucleotide sequences constructed for expression of analog TNF-R must, of course, 

45 presen/e the reading frame phase of the coding sequences and preferably will not create complementary 
regions tfiat could hybridize to produce secondary mRNA structures such as loops or hairpins which would 
adversely affect translation of tiie receptor mRNA. Although a mutation site may be predetermined, it is not 
necessary that the nature of the mutation per se be predetermined. For example, in order to select for 
optimum characteristics of mutants at a given site, random mutagenesis may be conducted at the target 

50 codon and the expressed TNF-R mutants screened for the desired activity. 

Not all mutations in tiie nucleotide sequence which encodes TNF-R will be expressed in the final 
product, for example, nucleotide substitutions may be made to enhance expression, primarily to avoid 
secondary structure loops in the ti'anscribed mRNA (see EPA 75,444A, incorporated herein by reference), 
or to provide codons that are more readily translated by the selected host e.g., the well-known E colt 

66 preference codons for £. coll expression. 

Mutations can be introduced at particular loci by synthesizing oligonucleotides containing a mutant 
sequence, flanked by restilction sites enabling ligation to fragments of ttie native sequence. Following 
ligation, the resulting reconstructed sequence encodes an analog having the desired amino acid insertion. 
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substitution, or deletion. 

Alternatively, oligonucleotide-dtrected site-specific mutagenesis procedures can be employed to provide 
an altered gene having particular codons altered according to the substitution, deletion, or insertion 
required. Exemplary methods of making the alterations set forth above are disclosed by Walder et al. (Gene 

5 42:133, 1986): Bauer et al. {Gene 37:73, 1985); Craik (BfoTechniques, January 1985, 12-19); Smith et al. 
[Genetic Engineering: Principles and Methods, Plenum Press. 1981): and U.S. Patent Nos. 4.518.584 and 
4,737.462 disclose suitable techniques, and are incorporated by reference herein. 

Both monovalent forms and polyvalent forms of TNF-R are useful in the compositions and methods of 
this invention. Polyvalent forms possess multiple TNF-R binding sites for TNF ligand. For example, a 

10 bivalent soluble TNF-R may consist of two tandem repeats of amino acids 1-236 of Rgure 2A, separated by 
a linker region. Alternate polyvalent forms may also be constructed, for example, by chemically coupling 
TNF-R to any clinically acceptable carrier molecule, a polymer selected from the group consisting of Ficoll, 
polyethylene glycol or dextran using conventional coupling techniques. Alternatively, TNF-R may be 
chemically coupled to biotin, the biotin-TNF-R conjugate then allowed to bind to avidin, resulting in 

IS tetravalent avidin/biotin/TNF-R molecules. TNF-R may also be covalently coupled to dlnitrophenol (DNP) or 
trinitrophenol (TNP) a^d the resulting coniugate precipitated with anti-DNP or anti-TNP-IgM, to form 
decameric conjugates with a valency of 10 for TNF-R binding sites. 

A recombinant chimeric antibody molecule may also be produced having TNF-R sequences substituted 
for the variable domains of either or both of the immunoglubulin molecule heavy and light chains and 

20 having unmodified constant region domains. For example, chimeric TNF-R/!gQi may be produced from two 
chimeric genes - a TNF-R/human x light chain chimera (TNF-R/C,) and a TNF-R/human 71 heavy chain 
chimera (TNF-R/C^i). Foltowing transcription and translation of the two chimeric genes, the gene products 
assemble into a single chimeric antibody molecule having TNF-R displayed bivalently. Such polyvalent 
forms of TNF-R may have enhanced binding affinity for TNF ligand. Additional details relating to the 

25 construction of such chimeric antibody molecules are disclosed in WO 89/09622 and EP 315062. 



Expression of Recombinant TNF-R 

30 The present invention provides recombinant expression vectors to amplify or express DNA encoding 
TNF-R. Recombinant expression vectors are replicable DNA constructs which have synthetic or cDNA- 
derived DNA fragments encoding mammalian TNF-R or bioequivalent analogs operably linked to suitable 
transcriptional or translational regulatory elements derived from mammalian, microbial, viral or insect genes. 
A transcriptional unit generally comprises an assembly of (1) a genetic element or elements having a 

35 regulatory role in gene expression, for example, transcriptional promoters or enhancers. (2) a structural or 
coding sequence which is transcribed into mRNA and translated into protein, and (3) appropriate transcrip- 
tion and translation initiation and termination sequences, as described in detail below. Such regulatory 
elements may include an operator sequence to control transcription, a sequence encoding suitable mRNA 
ribosomal binding sites. The ability to replicate in a host usually conferred by an origin of replication, and a 

40 selection gene to facilitate recognition of transfomnants may additionally be incorporated. DNA regions are 
operably linked when they are functionally related to each other. For example, DNA for a signal peptide 
(secretory leader) is operably linked to DNA for a polypeptide If It is expressed as a precursor which 
participates in the secretion of the polypeptide: a promoter is operably linked to a coding sequence if it 
controls the transcription of the sequence; or a ribosome binding site is operably linked to a coding 

45 sequence if it is positioned so as to permit translation. Generally, operably linked means contiguous and, in 
the case of secretory leaders, contiguous and in reading frame. Structural elements intended for use in 
yeast expression systems preferably include a leader sequence enabling extracellular secretion of trans- 
lated protein by a host cell. Alternatively, where recombinant protein is expressed without a leader or 
transport sequence, it may include an N-lenminal methionine residue. This residue may optionally be 

50 subsequently cleaved from the expressed recombinant protein to provide a final product 

DNA sequences encoding mammalian TNF receptors which are to be expressed In a microorganism 
will preferably contain no introns that could prematurely terminate transcription of DNA into mRNA; 
however, premature tenmination of transcription may be desirable, for example, where it would result in 
mutants having advantageous C-terminal truncations, for example, deletion of a transmembrane region to 

55 yield a soluble receptor not bound to the cell membrane. Due to code degeneracy, there can be 
considerable variation in nucleotide sequences encoding the same amino acid sequence. Other embodi- 
ments include sequences capable of hybridizing to the sequences of the provided cDNA under moderately 
stringent conditions (50* C, 2x SSC) and other sequences hybridizing or degenerate to those which encode 
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biologically active TNF receptor polypeptides. 

Recombinant TNF-R DNA is expressed or amplifiod in a recombinant expression system comprising a 
substantially homogeneous monoculture of suitable host microorganisms, for example, bacteria such as £ 
coll or yeast such as S, cerevtsiae, which have stably integrated (by transformation or transfection) a 
5 recombinant transcriptional unit into chromosomal DNA or carry the recombinant transcriptional unit as a 
component of a resident plasmid. Generally, cells constituting the system are tiie progeny of a single 
ancestral transformant Recombinant expression systems as defined herein will express heterologous 
protein upon induction of the regulatory elements linked to the DNA sequence or synthetic gene to be 
expressed. 

w Transformed host cells are cells which have been transformed or transfected with TNF-R vectors 
constructed using recombinant DNA techniques. Transformed host cells ordinarily express TNF-R, but host 
cells transformed for purposes of cloning or amplifying TNF-R DNA do not need to express TNF-R. 
Expressed TNF-R will be deposited in the cell membrane or secreted into the culture supernatant 
depending on the TNF-R DNA selected. Suitable host cells for expression of mammalian TNF-R include 

IS prokaryotes. yeast or higher eukaryotic cells under ttie control of appropriate promoters. Prokaryotes 
include gram negative or gram positive organisms, for example £ coti or bacilli. Higher eukaryotic cells 
include established cell lines of mammalian origin as described below. Cell-free translation systems could 
also be employed to produce mammalian TNF-R using RNAs derived from the DNA constructs of tiie 
present invention. Appropriate cloning and expression vectors for use with bacterial, fungal, yeast, and 

20 mammalian cellular hosts are described by Pouwels et al. (Cfonfng Vectors: A Laboratory ManuaL 
Elsevier. New Yortc, 1985), the relevant disclosure of which is hereby incorporated by reference, 

Prokaryotic expression hosts may be used for expression of TNF-R that do not require extensive 
proteolytic and disulfide processing. Prokaryotic expression vectors generally comprise one or more 
phenotypic selectable markers, for example a gene encoding proteins conferring antibiotic resistance or 

25 supplying an autotrophic requirement, and an origin of replication recognized by the host to ensure 
amplification within the host. Suitable prokaryotic hosts for transformation include £ co//, Baciitus subtilts, 
Saimor)ella typhlmurlum, and various species witiiin the genera Pseudomonas, Streptomyces, and 
Staphyolococcus, although others may also be employed as a matter of choice. 

Useful expression vectors for bacterial use can comprise a selectable mari<er and bacterial origin of 

30 replication derived from commercially available plasmids comprising genetic elements of the well known 
cloning vector pBR322 (ATCC 37017), Such commerdai vectors include, for example, pKK223-3 
(Phamriacia Rne Chemicals. Uppsala, Sweden) and pGEM1 (Promega Biotec. Madison. Wl, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the structural sequence to be 
expressed. £ coll is typically transformed using derivatives of pBR322. a plasmid derived from an £ coli 

35 species (Bolivar et al.. Gene 2:95, 1977). pBR322 contains genes for ampidllin and tetracycline resistance 
and thus provides simple means for identifying transformed cells. 

Promoters commonly used in recombinant microbial expression vectors include the ;8-lactamase 
(penicillinase) and lactose promoter system (Chang et al.. Nature 275:815, 1978: and Goedde! et al., 
Nature 281:544, 1979), the tryptophan (trp) promoter system (Goeddel et al., Nucl. Acids Res. 8:4057, 

40 1980; and EPA 36,776) and tac promoter (Maniatis, Molecular Cloning: A Lai ratory Manual, Cold Spring 
Harbor Laboratory, p. 412, 1982). A particularly useful bacterial expression system employs the phage x P|. 
promoter and cl857ts thennolabile repressor. Plasmid vectors available from the American Type Culture 
Collection which incorporate derivatives of the X Pl promoter include plasmid pHUB2, resident in £ coli 
strain JMB9 (ATCC 37092) and pPLc28. resident in £ coli RR1 (ATCC 53082). 

45 Recombinant TNF-R proteins may also be expressed in yeast hosts, preferably from tine Sac- 
charomyces species, such as S. cerevfsiae. Yeast of other genera, such as PIchIa or Kluyveromyces 
may also be emptoyed. Yeast vectors will generally contain an origin of replication from the 2u yeast 
plasmid or an autonomously replicating sequence (ARS), promoter, DNA encoding TNF-R, sequences for 
polyadenylation and transcription termination and a selection gene. Preferably, yeast vectors will Include an 

so origin of replication and selectable maricer permitting transformation of both yeast and £ colK e.g., the 
ampicillin resistance gene of £ coif and S. cerevislae TRPI or URA3 gene, which provides a selection 
mariner for a mutant strain of yeast lacking the ability to grow in tryptophan, and a promoter derived from a 
highly expressed yeast gene to Induce transcription of a staictural sequence downstream. The presence of 
the TRPI or URA3 lesion in the yeast host cell genome tiien provides an effective environment for 

56 detecting transfonmation by growth in tiie absence of tryptophan or uracil. 

Suitable promoter sequences in yeast vectors include the promoters for metallothionein, 3- 
phosphoglycerate kinase (Hitzeman et al., J, Blot. Chem. 25S:207Z, 1980) or other glycolytic enzymes 
(Hess et al.. J. Adv, Enzyme Reg. 7:149. 1968: and Holland et al.. Bfochem. / 7:4900, 1978), such as 
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enolase, glyceratdGhyde-3iDhosphat9 dehydrogenase, hexokinase. pyruvate decartx)xylase, phosphofaic- 
tokinase. glucose-6-pliosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate 
isom erase, phosphoglucose isomerase. and glucoktnase. Suitable vectors and promoters for use in yeast 
expression are further described in R. Hitzeman et al.. EPA 73.657. 

5 Prefen^ed yeast vectors can be assembled using DNA sequences from pUC18 for selection and 
replication in £ coii (Amp' gene and origin of replication) and yeast DNA sequences Including a glucose- 
repressible A0H2 promoter and a-f actor secretion leader. The A0H2 promoter has been described by 
Russell et al. {J. Blof. Chem. 255:2674. 1982) and Beier et al. {Nature 300:724, 1982). The yeast a-factor 
leader, which directs secretion of heterologous proteins, can be inserted between the promoter and the 

10 structural gene to be expressed. See, e.g.. Kurjan et al.. Cetl 30:933. 1982; and Bitter et al.. Proc, NatL 
Acad, ScL USA fi/:5330, 1984. The leader sequence may be modified to contain, near its 3' end. one or 
more useful restriction sites to facilitate fusion of the leader sequence to foreign genes. 

Suitable yeast transfonnnation protocols are known to those of skill in the art; an exemplary technique is 
described by Hinnen et al.. Proc, Natl. Acad. Scl, USA 75:1929. 1978, selecting for Trp* transfonmants in a 

/5 selective medium consisting of 0.67% yeast nitrogen base. 0.5% casamino acids, 2% glucose. 10 ug/ml 
adenine and 20 ug/ml uracil or URA+ tranformants in medium consisting of 0.67% YNB, with amino acids 
and bases as described by Sherman et al., Laboratory Course Manual for Methods In Yeast Genetics, 
Cold Spring Harbor Laboratory. Cold Spring Harbor. New York, 1986. 

Host strains transformed by vectors comprising the A0H2 promoter may be grown for expression in a 

20 rich medium consisting of 1% yeast extract. 2% peptone, and 1% or 4% glucose supplemented with 80 
ug/ml adenine and 80 ug/ml uracil. Derepression of the ADH2 promoter occurs upon exhaustion of medium 
glucose. Crude yeast supernatants are harvested by filtration and held at 4* C prior to further purification. 

Various mammalian or insect ceil culture systems are also advantageously employed to express 
recombinant protein. Expression of recombinant proteins In mammalian cells Is particulariy preferred 

26 because such proteins are generally conrectly folded, appropriately modified and completely functional. 
Examples of suitable mammalian host cell lines include the COS-7 lines of monkey kidney cells, described 
by Qluzman (Cell 23:175. 1981), and other cell lines capable of expressing an appropriate vector Including, 
for example. L cells. CI 27, 3T3, Chinese hamster ovary (CHO). HeLa and BHK cell lines. Mammalian 
expression .vectors may comprise nontranscribed elements such as an origin of replication, a suitable 

30 promoter and enhancer linked to the gene to be expressed, and other 5' or 3 flanking nontranscribed 
sequences, and 5 or 3' nontranslated sequences, such as necessary ribosome binding sites, a 
polyadenylation site, splice donor and acceptor sites, and transcriptional tenmi nation sequences. Baculovirus 
systems for production of heterologous proteins in insect cells are reviewed by Luckow and Summers. 
Bio/Technology 6:47 (1988). 

35 The transcriptional and translational control sequences in expression vectors to be used in transforming 
vertebrate cells may be provided by viral sources. For example, commonly used promoters and enhancers 
are derived from Polyoma, Adenovirus 2, Simian Virus 40 (SV40), and human cytomegalovirus. DNA 
sequences derived from the SV40 viral genome, for example. SV40 origin, eariy and late promoter, 
enhancer, splice, and polyadenylation sites may be used to provide the otiier genetic elements required for 

40 expression of a heterologous DNA sequence. The early and late promoters are particularly useful because 
both are obtained easily from tiie vims as a fragment which also contains the SV40 viral origin of replication 
(Rers et al.. Nature 273:113, 1978). Smaller or larger SV40 fragments may also be used, provided the 
approximately 250 bp sequence extending from the Hind 3 site toward ttie Bgh site located in the viral 
origin of replication is included. Further, mammalian genomic TNF-R promoter, control and/or signal 

45 sequences may be utilized, provided such control sequences are compatible witf^ tine host cell chosen. 
Additional details regarding the use of a mammalian high expression vector to produce a recombinant 
mammalian TNF receptor are provided in Examples 2 and 7 below. Exemplary vectors can be constructed 
as disclosed by Okayama and Berg {Mol. Cell. Biol. 3:280. 1983). 

A useful system for stable high level expression of mammalian receptor cDNAs in C127 murine 

50 mammary epithelial cells can be constructed substantially as described by Cosman et al. [Mol. Immunol. 
23:935. 1986). 

In preferred aspects of the present invention, recombinant expression vectors comprising TNF-R cDNAs 
are stably integrated into a host ceirs DNA. Elevated levels of expression product is achieved by selecting 
for cell lines having amplified numl^ers of vector DNA. Cell tines having amplified numbers of vector DNA 
55 are selected, for example, by transforming a host cell with a vector comprising a DNA sequence which 
encodes an enzyme which is inhibited by a known drug. The vector may also comprise a DNA sequence 
which encodes a desired protein. Alternatively, the host cell may be co-transformed with a second vector 
which conriprises the DNA sequence which encodes the desired protein. The transformed or co-transformed 
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host cells are then cultured in increasing concentrations of the known drug, thereby selecting for drug- 
resistant cells. Such drug-resistant cells survive In increased concentrations of the toxic drug by over- 
production of the enzyme which is inhibited by the drug, frequently as a result of amplification of the gene 
encoding the enzyme. Where drug resistance is caused by an increase in the copy number of the vector 

5 DNA encoding the inhibitable enzyme, there is a concomitant co-amplification of the vector DNA encoding 
the desired protein (TNF-R) in the host cell's DNA. 

A preferred system for such co-amplification uses the gene for dihydrofoiate reductase (OHFR), which 
can be inhibited by the drug methotrexate {MTX). To achieve co-amplification, a host cell which lacks an 
active gene encoding OHFR is either transfomned with a vector which comprises DNA sequence encoding 

10 DHFR and a desired protein, or is co-transformed with a vector comprising a DNA sequence encoding 
DHFR and a vector comprising a DNA sequence encoding the desired protein. The transformed or co- 
transformed host cells are cultured in media containing increasing levels of MTX, and those cells lines 
which survive are selected. 

A particularly preferred co-am pliflcatlon system uses the gene for glutamine synthetase (QS), which Is 

IS responsible for the synthesis of glutamate and ammonia using the hydrolysis of ATP to ADP and phosphate 
to drive the reaction. QS is subject to inhibition by a variety of inhibitors, for example methionine 
sulphoximine (MSX). Thus, TNF-R can be expressed in high concentrations by co-amplifying cells 
transformed with a vector comprising the DNA sequence for QS and a desired protein, or co-transfomned 
with a vector comprising a DNA sequence encoding GS and a vector comprising a DNA sequence encoding 

20 the desired protein, culturing the host cells in media containing increasing levels of MSX and selecting for 
surviving cells. The GS co-am pirfication system, appropriate recombinant expression vectors and cells lines, 
are described in the following PCT applications: WO 87/04462. WO 89/01036. WO 89/10404 and WO 
88A)5807. 

Recombinant proteins are preferably expressed by co-amplification of DHFR or GS in a mammalian 
25 host cell, such as Chinese Hamster Ovary (CHO) cells, or alternatively in a murine myeloma cell line, such 
as SP2/0-Ag14 or NSO or a rat myeloma cell line, such as YB2/3.0-Ag20. disclosed in PCT applications 
WO/89/10404 and WO 86/05807. 

A preferred eukaryotic vector for expression of TNF-R DNA is disclosed below in Example 2. This 
vector, referred to as pCAV/NOT. was derived from the mammalian high expression vector pDC20l and 
30 contains regulatory sequences from SV40, adenovirus-2, and human cytomegalovirus. 



Purification of Recombinant TNF-R 

35 Purified mammalian TNF receptors or analogs are prepared by culturing suitable host/vector systems to 
express the recombinant translation products of the DNAs of the present invention, which are then purified 
from culture media or cell extracts. 

For example, supernatants from systems which secrete recombinant protein into culture media can be 
first concentrated using a commercially available protein concentration filter, for example, an Amicon or 

40 MilHpore Pellicon ultrafiltration unit. Following the concentration step, the concentrate can be applied to a 
suitable purification matrix. For example, a suitable affinity matrix can comprise a TNF or lectin or antibody 
molecule bound to a suitable support. AHematively. an anion exchange resin can be employed, for example, 
a matrix or substrate having pendant diethylaminoethyl (DEAE) groups. The matrices can be acrylamlde, 
agarose, dextran, cellutose or other types commonly employed in protein purification. Alternatively, a cation 

45 exchange step can be employed. Suitable cation exchangers include various insoluble matrices comprising 
sulfopropyl or cartwxymethyl groups. Sulfopropyl groups are preferred. 

Finally, one or more reversed-phase high performance liquid chromatography (RP-HPLC) steps employ- 
ing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl or other aliphatic groups, can be 
employed to further purify a TNF-R composition. Some or all of the foregoing purification steps, in various 

50 combinations, can also be employed to provide a homogeneous recombinant protein. 

Recombinant protein produced in bacterial culture is usually isolated by initial extraction from cell 
pellets, followed by one or more concentration, salting-out, aqueous Ion exchange or size exclusion 
chromatography steps. Rnaily, high performance liquid chromatography (HPLC) can be employed for final 
purification steps. Microbial cells employed in expression of recombinant mammdlie'^ TNF-R can be 

65 disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or 
use of ceil lysing agents. 

Fermentation of yeast which express mammalian TNF-R as a secreted protein greatly simplifies 
purification. Secreted recombinant protein resulting from a large-scale fermentation can be purified by 
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methods analogous to those disclosed by Urdai et al. {J. Chromatog, 29&A7^, 1984). This reference 
describes two sequential, reversed-phase HPLC steps for purification of recombinant human GM-CSF on a 
preparative HPLC column. 

Human TNF-R synthesized in recombinant culture is characterized by the presence of non-human cell 

5 components, including proteins, in amounts and of a character which depend upon the purification steps 
taken to recover human TNF-R from the culture. These components ordinarily will be of yeast, prokaryotic 
or non-human higher eukaryotic origin and preferably are present in innocuous contaminant quantltieSt on 
the order of less than about 1 percent by weight. Further, recombinant cell culture enables the production of 
TNF-R free of proteins which may be normally associated with TNF-R as it is found in nature in its species 

10 of origin, e.g. in cells, cell exudates or body fluids. 



Therapeutic Administration of Recombinant Soluble TNF-R 



75 The present invention provides- methods of using therapeutic compositions comprising an effective 
amount of soluble TNF-R proteins and a suitable diluent and carrier, and methods for suppressing TNF- 
dependent inflammatory responses in humans comprising administering an effective amount of soluble 
TNF-R protein. 

For therapeutic use. purified soluble TNF-R protein is administered to a patient, preferably a human, for 

20 treatment in a manner appropriate to the Indication. Thus, for example, soluble TNF-R protein compositions 
can be administered by bolus injection, continuous infusion, sustained release from Implants, or other 
suitable technique. Typicaily, a soluble TNF-R therapeutic agent will be administered in the fomn of a 
composition comprising purified protein in conjunction with physiologically acceptable carriers, excipients or 
diluents. Such carriers will be nontoxic to recipients at the dosages and concentrations employed. 

25 Ordinarily, the preparation of such compositions entails combining the TNF-R with buffers, antioxidants such 
as ascorbic acid, low molecular weight (less than about 10 residues) polypeptides, proteins, amino adds, 
carbohydrates including glucose, sucrose or dextrins, chelating agents such as EDTA, glutathione and other 
stabilizers and excipients. Neutral buffered saline or saline mixed with conspecific serum albumin are 
exemplary appropriate diluents. Preferably, product is formulated as a lyophilizate using appropriate 

30 exdpient solutions (e.g., sucrose) as diluents. Appropriate dosages can be deteonined In trials. The amount 
and frequency of administration will depend, of course, on such factors as the nature and severity of the 
indication being treated, the desired response, the condition of tiie patient, and so forth. 

Soluble TNF-R proteins are administered for the purpose of Inhibiting TNF-dependent responses. A 
variety of diseases or conditions are believed to be caused by TNF, such as cachexia and septic shock. In 

35 addition, other key cytokines (IL-1 , IL-2 and other colony stimulating factors) can also Induce significant host 
production of TNF. Soluble TNF-R compositions may therefore be used, for example, to treat cachexia or 
septic shock or to treat side effects assodated with cytokine therapy. Because of the primary roles IL-1 and 
IL-2 play in the production of TNF, combination ttierapy using both IL-1 receptors or IL-2 receptors may be 
prefenred in the treatment of TNF-associated clinical indications. 

40 The following examples are offered by way of illustration, and not by way of limitation. 



EXAMPLES 

46 

Example 1 



50 

Binding Assays 



A, Rauloi'ibermg of TNFa and TNF0. Recombinant human TNFo, in the form of a fusion protein 
55 containing a hydrophilic octapeptide al the N-terminus, was expressed in yeast as a secreted protein and 
purified by affinity chromatography (Hopp et al., Bio/ Technology 6:1204, 1988). Purified recombinant 
human TNF^ was purchased from R&D Systems (Minneapolis, MN). Both proteins were radiolabeled 
using the commerdally available solid phase agent, lODOGEN (Pierce), In tiiis procedure, 5 ug of 
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lODO-GEN were plated at the bottom of a 10 x 75 mm glass tube and incubated for 20 minutes at 4* C 
with 75 ul of 0.1 M sodium phosphate, pH 7.4 and 20 ul (2 mCi) Na '^l This solution was then 
transferred to a second glass tube containing 5 ug TNFa (or TNF/3) in 45 ul PBS for 20 minutes at 4* C. 
The reaction mixture was fractionated by gel filtration on a 2 ml bed volume of Sephadex Q-25 (Sigma) 

5 equilibrated in Roswell Park Memorial Institute (RPMI) 1 640 medium containing 2.5% (w/v) bovine 
senjm albumin (BSA), 0.2% (w/v) sodium azide and 20 mM Hopes pH 7.4 (binding medium). The final 
pool of ^^l-TNF was diluted to a working stock solution of 1 x 10"' M In binding medium and stored for 
up to one month at 4'C without detectable loss of receptor binding activity. The specific activity is 
routinely 1x10* cpm/mmole TNF. 

10 B. Binding to Intact Cells. Binding assays with intact cells were performed by two methods. In tiie first 
method, cells were first grown either in suspension (e.g.. U 937) or by adherence on tissue culture plates 
(e.g., W126-VA4. COS ceils expressing the recombinant TNF receptor). Adherent cells were subsequentiy 
removed by treatment with 5mM EDTA treatment for ten minutes at 37 degrees centigrade. Binding 
assays were then performed by a pthalate oil separation metiiod (Dower et al.. J, Immunol, /32:751, 

15 1984) essentially as described by Park et al. (J. Biol. Chem. 2BVA\T7, 1988). Non-specific binding of 
'25HNF was measured in the presence of a 200-fold or greater molar excess of unlabeled TNF. Sodium 
azide (0.2%) was included in a binding assay to inhibit internalization of ^^si-TNF by cells. In the second 
method, COS cells transfected with the TNF-R-containing plasmid, and expressing TNF receptors on the 
surface, were tested for the ability to bind '^si-iNF by the plate binding assay described by Sims et al. 

20 {Science 24 ;:585. 1 988). 

C. Solid Phase Binding Assays. The ability of TNF-R to be stably adsorbed to nitix>cellulose from 
detergent extracts of human cells yet retain TNF-binding activity provided a means of detecting TNF-R. 
Celt extracts were prepared by mixing a cell pellet with a 2 x volume of PBS containing 1% Triton X-100 
and a cocktail of protease Inhibitors (2 mM phenylmethyl sulfonyl fluoride. 10 uM pepstatin. 10 uM 

25 leupeptin. 2 mM o-phenanttiroline and 2 mM EGTA) by vigorous vortexing. The mixture was Incubated 
on ice for 30 minutes after which it was centrifuged at I2.000x g for15 minutes at 8* C to remove nuclei 
and other debris. Two microliter aiiquots of cell extracts were placed on dry BA85/21 nitrocellulose 
membranes (Schleicher and Schuell. Keene, NH) and allowed to dry. The membranes were incubated in 
tissue culture dishes for 30 minutes in Tris (0.05 M) buffered saline (0.15 M) pH 7.5 containing 3% w/v 

» BSA to block nonspecific binding sites. The membrane was then covered with 5 x 10"^^ M '^^i-xmp in 
PBS + 3% BSA and incubated for 2 hr at 4* C witfi shaking. At tiie end of ttiis time, die membranes 
were washed 3 times in PBS, dried and placed on Kodak X-Omat AR film for 18 hr at -70* C. 

Example 2 

35 



Isolation of Human TNF-R cDNA by Direct Expression of Active Protein in COS-7 Cells 



40 

Various human cell lines were screened for expression of TNF-R based on ttieir ability to bind '^sj. 
labeled TNF. The human fibroblast cell line Wl-2e VA4 was found to express a reasonable number of 
receptors per cell. Equilibrium binding studies showed that m cell line exhibited biphasic binding of '^i- 
TNF witti approximately 4.000 high affinity sites (Ka = 1 x 10^° M"*) and 15.00 low affinity sites (K. = l x 
45 10^ M-') per cell. 

An unsized cDNA library was constructed by reverse transcription of poiyadenylated mRNA isolated 
from total RNA extracted from human fibroblast WI-26 VA4 cells grown in the presence of pokeweed 
mitogen using standard techniques (Gubler. et al., Gene 25:263. 1983: Ausubel et al.. eds., Current 
Protocols In Molecular Biology, Vol. 1, 1987). The cells were harvested by lysing tiie cells in a guanidine 

50 hydrochloride solution and total RNA isolated as previously described (March et ai.. Nature 3/5:641 , 1985). 
Poly A* RNA was isolated by oligo dT cellulose chromatography and double-stranded cDNA was 
prepared by a metiiod similar to that of Gubler and Hoffman (Gene 25:263. 1983). Briefly, tiie poly A* RNA 
was converted to an RNA-c.DNA hybrid by reverse ti-anscriptase using oligo dT as a primer. The RNA-cONA 
hybrid was tt^en converted into double-sti-anded cDNA using RNAase H in combination witii DNA poly- 

55 merase I. The resulting double stranded cDNA was blunt-ended with T4 DNA polymerase. To the blunt- 
ended cDNA is added fcoffl linker-adapters (having internal A/ofl sites) which were phosphorylated on only 
one end (Invitrogen). The linker-adaptered cDNA was treated witii T4 polynucleotide kinase to 
phosphorylate the S overhanging region of the linker-adapter and unligated linkers were removed by 
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running the cDNA over a Sepharose CL4B column. The linker-adaplered cDNA was ligated to an equimolar 
concentration of EcoR^ cut and dephosphorylated arms of bacteriophage Xgt10 (Huynh at al, DNA Cloning: 
A Practical Approach, Glover, ed.. IRL Press, pp. 4d-78). The ligated DNA was packaged into phage 
particles using a commercially available kit to generate a library of recombinants (Stratagene Cloning 

s Systems, San Diego. OA, USA). Recombinants were further amplified by plating phage on a bacterial lawn 
of £ coll strain c600(hfl"). 

Phage DNA was purified from the resulting XgtIO cDf^ library and the cDNA inserts excised by 
digestion with the restriction enzyme Not\ . Following electrophoresis of the digest through an agarose gel, 
cDNAs greater than 2,000 bp were isolated. 

10 The resulting cDNAs were ligated into the eukaryotic expression vector pCAV/NOT, which was 
designed to express cDNA sequences inserted at its multiple cloning site when transfected into mammalian 
cells. pCAV/NOT was assembled from pDC201 (a derivative of pMLSV, previously described by Cosman et 
al„ Nature 312: 768, 1984), SV40 and cytomegalovirus DNA and comprises, In sequence with the direction 
of transcription from the origin of replication: (1) SV40 sequences from coordinates 5171-270 Including the 

75 origin of replication, enhancer sequences and early and late promoters; (2) cytomegalovirus sequences 
including the promoter and enhancer regions (nucleotides 671 to -t-SS from the sequence published by 
Boechart et al. (Cell 4f:521, 1985); (3) adenovirus-2 sequences containing the first exon and part of the 
intron between the first and second exons of the tripartite leader, the second exon and part of the third exon 
of the tripartite leader and a multiple cloning site (MCS) containing sites for Xhol . Kpnl . Smal . Noti and 

20 Bg/1; (4) SV40 sequences from coordinates 4127-4100 and 2770-2533 that IrKlude the polyadenylation and 
termination signals for early transcription; (5) sequences derived from pBR322 and virus-associated 
sequences VAI and VAII of pDC201, with adenovirus sequences 10532-11156 containing the VAI and VAll 
genes, followed by pBR322 sequences from 4363-2486 and 1094-375 containing the ampicillin resistance 
gene and origin of replication. 

25 The resulting WI-26 VA4 cDNA library in pCAV/NOT was used to transform £ coll strain DH5a, and 
recombinants were plated to provide approximately 800 colonies per plate and sufficient plates to provide 
approximately 50.000 total colonies per screen. Colonies were scraped from each plate, pooled, and 
plasmid DNA prepared from each pool. The pooled DNA was then used to transfect a sub-confluent layer of 
monkey COS-7 cells using DEAE-dextran followed by chloroquine treatment, as described by Luthman et 

30 al, {NucL Acids Res, rr:l295, 1983) and McCutchan et al. (J, Natl. Cancer Inst 4/:351, 1988). The cells 
were then grown In culture for three days to permit transient expression of the inserted sequences. After 
three days, cell culture supematants were discarded and the cell monolayers in each plate assayed for TNF 
binding as follows. Three ml of binding medium containing 1,2 x 10~" M '"[-labeled F1-AG»-TNF was 
added to each plate and the plates incubated at 4*C for 120 minutes. This medium was then discarded, 

35 and each plate was washed once with cold binding medium (containing no labeled TNF) and twice with cold 
PBS. The edges of each plate were then broken off, leaving a flat disk which was contacted with X-ray film 
for 72 hours at -70 *C using an intensifying screen, TNF binding activity was visualized on the exposed 
films as a dark focus against a relatively uniform background. 

After approximately 240,000 recombinants the library had been screened in this manner, one transfec- 

40 tant pool was ob>served to provide TNF binding foci which were cleariy apparent against the background 
exposure. 

A frozen stock of bacteria from the positive poo! was then used to obtain plates of approximately 150 
colonies. Replicas of these plates were made on nitrocellulose filters, and the plates were then scraped and 
plasmid DNA prepared and transfected as described above to identify a positive plate. Bacteria from 

45 individual colonies from the nitrocellulose replica of this plate were grown in 0.2 ml cultures, which were 
used to obtain plasmid DNA, which was transfected into COS-7 cells as described above. In this manner, a 
single clone, clone 1, was isolated which was capable of inducing expression of human TNF-R in COS cells. 
The expresstof) vector pCAV/NOT containing the TNF-R cDNA clone 1 has been deposited with the 
American Type Culture Collection. 12301 Parklawn Drive, Rockville, MD 20852. USA (Accession No. 68088) 

50 under the name pCAV/NOT-TNF-R, on 6th Sept. 1989. 



Example 3 

55 

Construction of cDNAs Encoding Soluble huTNF-RA235 
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A cDNA encoding a soluble huTNF-RA235 (having the sequence of amino acids 1-235 of Figure 2A) 
was constructed by excising an 840 bp fragment from pCAV/NOT-TNF-R with the restriction enzymes Noti 
and Pvu2. Noti cuts at the multiple cloning site of pCAV/NOT-TNF-R and Pvu2 cuts within the TNF-R 
coding region 20 nucleotides s' of the transmembrane region. In order to reconstruct the 3 end of the TNF- 
R sequences, two oligonucleotides were synthesized and annealed to create the following oligonucleotide 
linker: 

Pvu2 BamHl Bgl2 

CTGAAGGGAGCACTGGCGACX^GGATCCA 
GACTTCCCTCGTGACCGCTGATTCCTAGGTCTAG 
AlaGluGlySerThrGlyA3p£xul 

This oligonucleotide linker has terminal Pvu2 and Bgl2 restriction sites, regenerates 20 nucleotides of the 
TNF-R, followed by a termination codon (underlined) and a BamHl restriction site (for convenience In 
isolating the entire soluble TNF-R by Not1 /BamHl digestion). This oligonucleotide was then ligated with the 
840 bp Not1/Pvu2 TNF-R insert into Bgl2/Not1 cut pCAV/NOT to yield psoihuTNF-RA235/CAVNOT, which 
was transfected into COS-7 cells as described above. This expression vector induced expression of soluble 
human TNF-R which was capable of binding TNF. 



Example 4 



Construction of cDNAs Encoding Soluble huTNF-RA185 



A cDNA encoding a soluble huTNF-RA185 (having the sequence of amino acids 1-185 of Rgure 2A) 
was constructed by excising a 640 bp fragment from pCAV/NOT-TNF-R with the restriction enzymes Noti 
and Bgi2. Not1 cuts at the multiple cloning site of pCAV/NO-TNF-R and Bgl2 cuts within the TNF-R coding 
region at nucleotide 637. which is 237 nucleotides s' of the transmembrane region. The following 
oligonucleotide linkers were synthesized: 

Bgl2 

5 ' -GATCTGTAACGTGGTGGCCATCCCTGGGAATGCAAGCATGGATGC-3 ' 
ACATTGCACCACCGGTAGGGACCCTTACGTTCG 
IleCysAfinValValAlalleProGlyAsnAlaSerMetAdpAla 



Noti 

5 • - AGrCTGCACGTCCACGXCCCCCACCCGGiaGC -3 * 

TACCTACGTCAGACGTGCAGGTGCAGGGGGTGGGCCACTCGCCGG 
ValCysThrSerThrSdrProThrArgEnd 

The above oligonucleotide linkers reconstruct the 3' end of the receptor molecule up to nucleotide 708. 
followed by a termination codon (underlined). These oligonucleotides were then ligated with the 640 bp 
Not1 TNF-R insert into Noti cut pCAV/NOT to yield the expression vector psolTNFRA185/CAVNOT. which 
was transfected into COS-7 cells as described at>ove. This expression vector induced expression of soluble 
human TNF-R which was capable of binding TNF. 



Example 5 



Construction of cDNAs Encoding Soluble huTNF-RAie3 
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A cDNA encxxJing a soluble huTNF-RA163 (having the sequence of amino adds 1-163 of Rgure 2A) 
was constructed by excising a 640 bp fragment from from pCAV/NOT-TNF-R with the restriction enzymes 
Noti and Bgl2 as described in Example 4. The following oligonucleotide linkers were synthesized: 

Bgl2 Not! 
5 • -GATCTGTTfiAGC - 3 ' 

ACAACTCGCCGG 
IleCysEnd 

This above oligonucleotide linker reconstructs the 3' end of the receptor molecule up to nucleotide 642 
(amino acid 163). followed by a termination codon (underlined). This oligonucleotide was then ligated with 
the 640 bp Noti TNF-R insert into Noti cut pCAV/NOT to yield the expression vector 
psolTNFRAie3/CAVNOT. which was transfected into COS-7 cells as described above. This expression 
vector induced expression of soluble human TNF-R which was capable of binding TNF in the binding assay 
described in Example 1 . 



Example 6 



Construction of cDNAs Encoding Soluble huTNF-RA142 



A cDNA encoding a soluble huTNF-RAl42 (having the sequence of amino acids 1-142 of Figure 2A) 
was constructed by excising a 550 bp fragment from from pCAV/NOT-TNF-R with the restriction enzymes 
Not1 and AlwNI. AiwN1 cuts within the TNF-R coding region at nucleotide 549. The following 
oligonucleotide linker was synthesized: 

Bgl2 Noti 
5 • -CTGAAACATCAGACGTGGTGTGCAAGCCCTGT2aAA-3 ' 
CTTGACTTTGTAGTCTGCACCACACGTTCGGGACAATTTCTAGA 

End 

This above oligonucleotide linker reconstructs the 3' end of the receptor molecule up to nucleotide 579 
(amino acid 142), followed by a termination codon (underlined). This oligonucleotide was then ligated with 
the 550 bp Notl/AiwNI TNF-R insert into Not1/Bgl2 cut pCAV/NOT to yieW the expression vector 
psoiTNFRA142.'CAVNOT, which was transfected into COS-7 cells as described above. This expression 
vector did not induced expression of soluble human TNF-R which .was capable of binding TNF. It is 
believed that this particular construct failed to express biologically active TNF-R because one or more 
essential cysteine residue (e.g., Cys'^' or Cys^^^) required for intramolecular bonding (for formation of the 
proper tertiary structure of the TNF-R molecule) was eliminated. 



Example 7 



Expression of Soluble TNF Receptors in CHO Cells 



Soluble TNF receptor was expressed in Chinese Hamster Ovary (CHO) cells using the glutamlne- 
synthetase (GS) gene amplification system, substantially as described in POT patent application Nos. 
WO87/04462 and WOd9/01036. Briefly, CHO cells are transfected with an expression vector containing 
genes for both TNF-R and GS. CHO cells are selected for GS gene expression based on the ability of the 
transfected DNA to confer resistance to low levels of methionine sulphoximine (MSX). GS sequence 
amplification events in such cells are selected using elevated MSX concentrations in this way, contiguous 
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TNF-R sequences are also amplified and enhanced TNF-R expression is achieved. 

The vector used in the GS expression system was psolTNFR/P6/PSVLGS, which was constructed as 
follows. Rrst. the vector pSVLGS.1 (described in PCT Application Nos. WO87/044e2 and WO89/0103e, and 
available from Celltech. Ltd., Berkshire, UK) was cut with the Bam HI restriction enzyme and de- 

s phosphorylated with caif Intestinal alkaline phosphatase (CIAP) to prevent the vector from religating to itself. 
The BamH1 cut pSVLGS.I fragment was then Hgated to a 2.4 kb BamHI to Bgl2 fragment of pEEShCMV 
(described in PCT Application No, WO89/01036. also available from Celltech) which was cut with Bgl2. 
BamHI and Fspl to avoid two fragments of similar size, to yield an 11.2 kb vector designated 
p6/PSVLQS.1. pSVLGS.1 contains the glutamine synthetase selectable marker gene under control of the 

10 SV40 later promoter. The BamHI to Bgl2 fragment of pEEBhCMV contains the human cytomegalovirus 
major immediate early promoter (hCMV), a polylinker, and tiie SV40 early polyadenylation signal. The 
coding sequences for soluble TNF-R were added to p6/PSVLQS.1 by excising a Noti to BamHI fragment 
from the expression vector psolTNFR/CAVNOT (made according to Example 3 above), blunt ending with 
Klenow and ligating with Smal cut dephosphorylated pB/PSVLGS.I, thereby placing ttie solTNF-R coding 

15 sequences under the control of the hCMV promoter. This resulted in a single plasmid vector in which the 
SV40/GS and hCMB/solTNF-R transcription units are transcribed in opposite directions. This vector was 
designated psolTNFR/P6/PSVLGS. 

psolTNFR/Pe/PSVLGS was used to transfect CH0-K1 cells (available from ATCC. Rochville, MD, under 
accession number CCL 61) as follows. A monolayer of CH0-K1 cells were grown to subconfluency in 

20 Minimum Essential Medium (MEM) 10X (Gibco: 330-1581AJ) without glutamine and supplemented with 
10% dialysed fetal bovine serum (Gibco: 220-6300AJ). 1 mM sodium pyruvate (Sigma), MEM non-essential 
amino adds (Gibco: 320-11 40AQ), 500 uM asparagine and glutamate (Sigma) and nucleosides (30 uM 
adenosine, guanosine. cytidine and uridine and 10 uM thymidlne)(Slgma). 

Approximately 1 x 10« cells per 10 cm petri dish were transfected witii 10 ug of psoiTNFR/P6/PSVLGS 

25 by standard calcium phosphate precipitation, substantially as described by Graham & van der Eb. Vfroiogy 
52:456 (1983). Cells were subjected to glycerol shock (15% glycerol in serum-free culture medium for 
approximately 1 .5 minutes) approximately 4 hours after transfection, substantially as described by Frost & 
Wilfiams. Virology 91:39 (1978). and then washed with serum-free medium. One day later, transfected cells 
were fed with fresh selective medium containing MSX at a final concentration of 25 uM. Colonies of MSX- 

30 resistant surviving cells were visible within 3-4 weeks. Surviving colonies were transfenred to 24-well plates 
and allowed to grow to confluency in selective medium. Conditioned medium from confluent wells were 
then assayed for soluble TNF-R activity using ttie binding assay described in Example 1 above. These 
assays indicated that the colonies expressed biologically active soluble TNF-R. 

In order to select for GS gene amplification, several MSX-resislant cell lines are transfected with 

J5 psolTNFR/PB/PSVLGS and grown in various concentrations of MSX. For each cell line, approximately 1x10^ 
cells are plated in gradually increasing concentrations of 100 uM. 250 uM, 500 uM and 1 mM MSX and 
incubated for 10-14 days. After 12 days, colonies resistant to the higher levels of MSX appear. The 
surviving colonies are assayed for TNF-R activity using ttie binding assay described above in Example 1 . 
Each of these highly resistant cell lines contains cells which arise from muit.; independent amplification 

40 events. From these cells lines, one or more of the most highly resistant cells lines are isolated. The 
amplified cells with high production rates are then cloned by limiting dilution cloning. Mass cell cultures of 
the transfectants secrete active soluble TNF-R. 



45 Example 8 



Expression of Soluble Human TNF-R in Yeast 

50 

Soluble human TNF-R was expressed in yeast witii the expression vector plXY432. which was derived 
from ttie yeast expression vector plXY120 and plasmid pYEP352. plXY120 is identical to pYaHuGM (ATCC 
53157), except tiiat it contains no cDNA Insert and includes a polylinker/multiple cloning site witii a Nco1 
55 resti'iction site. 

A DNA fragment encoding TNF receptor and suitable for cloning into ttie yeast expression vector 
plXY120 was first generated by polymerase chain reaction (PCR) amplification of ttie extracellular portion of 
ttie full length receptor from pCAV/NOT-TNF-R (ATCC 68088). The following primers were used in ttiis PCR 
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amplification: 

yend Primer 

5 5 ' -TTCCGGTACCTTTGGATAAAAGAGACTACAAGGAC 

Aap7 1 8 ->ProLeuAspLy sArgAapTyrLysAsp 

GACGATGACAAGTTGCCCGCCCAGGTGGCATTTACA-3 • 
AapA3pA3pLy3< r TNF-R > 

10 

y End Primer rantisense^ 

5 ' -CCCGGGATCCIl^GTCGCCAGTGCTCCCTTCAGCTGGG-3 ' 
BainHl>End< TNF-R > 

75 

The 5' end oligonucleotide primer used in the amplification included an Asp718 restriction site at its s' end. 
followed by nucleotides encoding the 3' end of the yeast a-f actor leader sequence (Pro-Leu- Asp-Lys-Arg) 
and those encoding the 8 amino acids of the FLAG® peptide (AspTyrLysAspAspAspAspLys) fused to 

20 sequence encoding the 5' end of the mature receptor. The FLAG® peptide (Hopp et al.. Bio/Technology 
6:1204, 1988) is a highly antigenic sequence which reversibly binds the monoclonal antibody Ml (ATCC HB 
9259). The oligonucleotide used to generate the 3' end of the PCR-derived fragment is the antfsense 
strand of DNA encoding sequences which tenmtnate the open reading frame of the receptor after nucleotide 
704 of the mature coding region (following the Asp residue preceding the transmembrane domain) by 

2S introducing a TAA stop codon (underlined). The stop codon is then followed by a BamH1 restriction site. 
The DNA sequences encoding TNF-R are then amplified by PGR, substantially as described by Innis et al.. 
eds., PCR Protocols: A Guide to Methods and Applications (Academic Press, 1990). 

The PCR-derived DNA fragment encoding soluble human TNF-R was subcloned into the yeast 
expression vector plXY120 by digesting the PCR-derived DNA fragment with BamHI and Asp718 restriction 

30 enzymes, digesting plXY120 with BamHI and Asp718, and ligating the PCR fragment into the cut vector in 
vitro with T4 DNA ligase. The resulting construction (plXY424) fused the open reading frame of the FLAG»- 
soiuble TNF receptor in-frame to the complete o-factor leader sequence and placed expression in yeast 
under the aegis of the regulated yeast alcohol dehydrogenase (ADH2) promoter, identity of tiie nucleotide 
sequence of the soluble TNF receptor canried in plXY424 witti those in cDNA clone 1 were verified by DNA 

35 sequencing using die dideoxynucleotide chain termination method. plXY424 was then transformed Into £ 
coll strain RR1. 

Soluble human TNF receptor was also expressed and secreted in yeast in a second vector. This 
second vector was generated by recovering the plXY424 plasmid from £ coli and digesting with EcoRI 
and BamHI restriction enzynnes to isolate tfie fragment spanning tiie region encoding the A0H2 promoter, 

40 tiie a-factor leader, the FLAG^-soluble TNF receptor and the stop codon. This fragment was ligated In vitro 
Into EcoRI and BamHI cut plasmid pYEP352 (HitI et al., Yeast 2:163 (1986)), to yield the expression 
plasmid plXY432, which was transformed into £.co// strain RR1. 

To assess secretion of tiie soluble human TNF receptor from yeast plXY424 was purified and 
introduced into a diploid yeast strain of S. cerevlslae (XV2 181) by electroporation and selection for 

45 acquisition of Uie plasmld-borne yeast TRP1 * gene on media lacking tryptophan. To assess secretion of the 
receptor directed by plXY4d2, tiie plasmid was introduced into the yeast strain PB149-6b by electroporation 
followed by selection for tiie plasmid-bome URA3* gene with growth on media lacking uracil. Ovemight 
cultures were grown at 30* C in the appropriate selective media. The PB148-6b/plXY434 transformants were 
diluted Into YEP-1% glucose media and grown at 30 *C for 38-40 hours. Supernatants were prepared by 

50 removal of cells by centrifugation, and filtration of supernatants through 0.45u filters. 

The level of secreted receptor in tiie supernatants was determined by immuno-dotidlot. Briefly, 1 ul of 
supernatants, and dilutions of tiie supernatants, were spotted onto nitrocellulose filters and allowed to dry. 
After blocking non-spea'fic protein binding with a 3% 6SA solution, the filters were incubated witii diluted 
Ml anti-FLAG«> antibody, excess antitxidy was removed by washing and then dilutions of horseradish 

55 peroxidase conjugated anti-mouse IgG antibodies were incubated with tiie filters. After removal of excess 
secondary antibodies, peroxidase substrates were added and color development was allowed to proceed for 
approximately 10 minutes prior to removal of tiie substrate solution. 

The anti-FLAQ9 reactive material found in the supernatants demonstrated tiiat significant levels of 
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receptor were secreted by both expression systems. Comparisons demonstrated that the plXY432 system 
secreted approximately 8-16 times more soluble human TNF receptor than the piXY424 system. The 
supernatants were assayed for soluble TNF-H activity, as described in Example 1. by their ability to bind 
i2S|.jfgp„ and block TNFa binding. The plXY4C2 supernatants were found to contain significant levels of 
5 active soluble TNF-R. 



Example 9 

10 

Isolation of Murine TNF-R cDNAs 



IS Murine TNF-R cDNAs were isolated from a cDNA library made from murine 7B9 cells, an antigen- 
dependent helper T cell line derived from C57BL/6 mice, by cross-species hybridization with a human TNF- 
R probe. The cDNA library was constructed in XZAP (Stratagene. San Diego), substantially as described 
above in Example 2. by isolating polyadenylated RNA from the 7B9 cells. 

A doubie-stranded human TNF-R cDNA probe was produced by excising an approximately 3.5 kb Not1 

20 fragment of the human TNF-R clone 1 and ^sp.iabeling the cDNA using random primers (Boehringer- 
Mannheim). 

The murine cDNA library was amplified once and a total of 900.000 plaques were screened, substan- 
tially as described in Example 2. with the human TNF-R cDNA probe. Approximately 21 positive plaques 
were purified, and the Bluescript plasmids containing EcoRI-linkered inserts were excised (Stratagene. San 
25 Diego). Nucleic acid sequencing of a portion of murine TNF-R clone 1 1 indicated that the coding sequence 
of the murine TNF-R was approximately 88% homologous to the corresponding nucleotide sequence of 
human TNF-R. A partial nucleotide sequence of murine TNF-R cDNA clone 1 1 is set forth in Figures 3A-3B. 

30 Example 10 



Preparation of Monoclonal Antibodies to TNF-R 

Preparations of purified recombinant TNF-R, for example, human TNF-R, or transfected COS cells 
expressing high levels of TNF-R are employed to generate monoclonal antibodies against TNF-R using 
conventional techniques, for example, those disclosed in U.S. Patent 4.411.993. Such antibodies likely to be 
40 useful in interfering with TNF binding to TNF receptors, for example, in ameliorating toxic or other 
undesired effects of TNF. or as components of diagnostic or research assays for TNF or soluble TNF 
receptor. 

To immunize mice, TNF-R immunogen is emulsified in complete Freund's adjuvant and Injected in 
amounts ranging from 10-100 ug subcutaneously into Balb/c mice. Ten to twelve days later, the immunized 

45 animals are boosted with additional immunogen emulsified in incomplete Freund's adjuvant and periodically 
boosted thereafter on a weekly to biweekly immunization schedule. Serum samples are periodically taken 
by retro-orWtal bleeding or tail-tip excision for testing by dot-blot assay (antibody sandwich) or ELISA 
(enzyme-linked immunosorbent assay). Other assay procedures are also suitable. Following detection of an 
appropriate antibody titer, positive animals are given an Intravenous injection of antigen in saline. Three to 

50 four days later, the animals are sacrificed, splenocytes harvested, and fused to the murine myeloma cell 
line NS1. Hybridoma cell lines generated by this procedure are plated in multiple microtiter plates in a HAT 
selective medium (hypoxanthine, aminopterin, and thymidine) to inhibit proliferation of non-fused cells, 
myeloma hybrids, and spleen cell hybrids. 

Hybridoma clones thus generated can be screened by ELISA for reactivity with "i NF R, for example, by 

55 adaptations of the techniques disclosed by Engvall et aJ.. Immunochem, 5:871 (1971) and in U.S. Patent 
4,703,004. Positive clones are then inlected into the peritoneal cavities of syngeneic Balb/c mice to produce 
ascites containing high concentrations (>1 mg/ml) of anti-TNF-R monoclonal antibody. The resulting 
monoclonal antibody can be purified by ammonium sulfate precipitation followed by gel exclusion 
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chromatography, and/or affinity chromatography based on binding of antibody to Protein A of Staphylococ- 
cus aureus. 



5 Claims 

1 . An isolated DNA sequence encoding a biologically active mammalian INF receptor (TNF-R) protein. 

2. An isolated ON A sequence according to claim 1 , selected from the group consisting of: 

(a) cDNA clones having a nucleotide sequence derived from the coding region of a native mammalian 
70 TNF-R gene; 

(b) DNA sequences capable of hybridization to the clones of (a) under moderately stringent conditions 
(50* C. 2 X SSC) and which encode biologically active TNF-R protein; and 

(c) DNA sequences which are degenerate as a result of the genetic code to the DNA sequences defined 
in (a) and (b) and which encode biologically active TNF*R protein. 

IS 3. An isolated DNA sequence according to claim 1 which encodes a soluble human TNF-R protein. 

4. An isolated DNA sequence according to claim 3, wherein the soluble human TNF-R protein has an amino 
acid sequence comprises the sequence of amino acid residues 1-x of Rgure 2A, wherein x is selected from 
the group consisting of amino acids 1 63-235 

5. An isolated DNA sequence according to claim 3, wherein the soluble human TNF-R protein comprises 
20 the sequence of amino acids 1-235 of Rgure 2A. 

6. A DNA sequence according to claim 5, wherein amino acid residue 46 Is selected from the group 
consisting of lie and Thr and amino acid residue 118 is selected from the group consisting of Val and He. 

7. An isolated DNA sequence according to claim 3. wherein the soluble human TNF-R protein comprises 
the sequence of amino acids M85 of Figure 2A. 

25 8. An isolated DNA sequence according to claim 3, wherein the soluble human TNF-R protein comprises 
the sequence of amino acids 1-163 of Figure 2A 

9. A recombinant expression vector comprising a DNA sequence according to any one of claims 1-8. 

10. A process for preparing a biologically active mammalian TNF receptor (TNF-R) protein, comprising 
cuUuring a suitable host cell comprising a vector according to claim 8 under conditions promoting 

30 expression. 

11. A purified biologically active mammalian TNF receptor (TNF-R) protein. 

12. A purified biologically active soluble human TNF-R protein. 

13. A purified biologically active TNF-R protein according to claim 12, comprising the sequence of amino 
acid residues 1 -235 of Figure 2A. 

35 14. A purified biologically active TNF-R protein according to claim 12, comprising the sequence of amino 
acid residues 1-185 of Figure 2A. 

15. A purified biologically active TNF-R protein according to claim 12, comprising the sequence of amino 
acid residues 1-163 of Rgure 2A. 

16. The use of a mammalian TNF-R protein in preparing a medicament for regulating immune responses in 
40 mammals. 

17. The method of claim 16, wherein the TNF-R protein is human TNF-R and the mammal to be treated is a 
human. 

18. The use of mammalian TNF-R protein in preparing a pharmaceutical composition suitable for parenteral 
administration to a human patient for regulating immune responses. 

45 19. A process for detecting TNF or TNF-R molecules or the interaction thereof, comprising use of a 
mammalian TNF receptor protein, a soluble TNF receptor protein capable of binding TNF or substantially 
similar TNF-R analog produced by recombinant ceil culture. 
20. Antibodies immunoreactive with mammalian TNF receptors. 

50 Claims for the following Contracting Slate: ES 

1. A process for preparing a purified mammalian TNF receptor (TNF-R) protein, the process comprising 
coupling together successive amino acid residues by the formation of peptide bonds to fomn a TNF-R 
polypeptide. 

55 2. A process according to claim 1 , wherein the TNF-R protein is a soluble human TNF-R protein. 

3. A process according to claim 2, wherein the soluble TNF-R protein has an amino acid sequence 
comprising the sequence of amino acid residues 1-x of Figure 2A. wherein x is selected from the group 
consisting of amino acids 163-235. 
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4. A process according to claim 3, wherein the soluble TNF-R protein has an amio acid sequence which 
comprises the sequence of amino acid residues 1 -235 of Figure 2A. 

5. A process according to claim 3, wherein the soluble TNF-R protein has an amio acid sequence which 
comprises the sequerxse of amino acid residues 1-185 of Figure 2A. 

5 6. A process according to claim 3, wherein the soluble TNF-R protein has an amio acid sequence which 
comprises the sequence of amino acid residues 1-163 of Figure 2A. 

7. The use of a mammalian TNF-R protein in preparing a medicament for regulating immune responses In 
mammals. 

a. The use of a mammalian TNF-R protein in preparing a pharmaceutical composition suitable for parenteral 
70 administration to a human patient for regulating immune responses. 

9. A process for preparing a DNA sequence encoding a mammalian TNF receptor (TNF-R) protein, the 
process comprising coupling together successive nucleotide residues. 

10. A process for preparing a DNA sequence according to claim 9. wherein the DNA sequence encodes a 
soluble human TNF-R protein. 

15 11. A process for preparing a DNA sequence according to claim 10, wherein the DNA sequence encodes a 
soluble TNF-R protein having an amino acid sequence comprising the sequence of amino acid residues 1-x 
of Figure 2A, wherein x is selected from the group consisting of amino acids 163-235. 

12. A process for preparing a DNA sequence according to claim 10, wherein the DNA sequence encodes a 
soluble TNF-R protein having an amio acid sequence which comprises the sequence of amino acid 

20 residues 1-235 of Rgure 2A. 

13. A process for preparing a DNA sequence according to claim 10, wfierein the Dl^iA sequence encodes a 
soluble TNF-R protein having an amio acid sequence which comprises the sequence of amino acid 
residues 1-185 of Figure 2A. 

14. A process for preparing a DNA sequence according to claim 10, wherein the DNA sequence encodes a 
26 soluble TNF-R protein having an amio acid sequence which comprises the sequence of amino acid 

residues 1-163 of Figure 2A. 

15. A process for preparing a DNA sequence according to claim 9, said DNA being selected from the group 
consisting of: 

(a) cDNA clones having a nucleotide sequence derived from the coding region of a native mammalian 
30 TNF-R gene; 

(b) DNA sequences capable of hybridization to the clones of (a) under moderately stringent conditions 
(50* C, 2 X SSC) and which encode biologically active TNF-R protein; and 

(c) DNA sequences which are degenerate as a result of the genetic code to the DNA sequences defined 
in (a) and (b) and which encode biologically active TNF-R protein. 

35 16. A process for preparing a DNA sequence according to claim 9, said DNA encoding a TNF-R protein 
having the sequence of amino acids of the TNF-R protein expressed by pCAV/NOT-TNF-R (ATCC 68088). 

17. A process for preparing a recombinant expression vector, comprising ligating bacterial, yeast or 
mammalian expression vector DNA and a DNA sequence encoding a human TNF-R protein sequence. 

18. A process for preparing a mammalian TNF-R or an analog thereof, comprising culturing a suitable host 
40 cell comprising a vector prepared according to claim 17 under conditions promoting expression. 

19. A process for detecting TNF or TNF-R protein molecules or the interaction thereof, comprising use of a 
mammalian TNF-R protein, a soluble TNF-R protein capable of binding TNF or substantially similar TNF-R 
analog produced by recombinant cell cuiture. 

20. A process for the preparation of antibodies immunoreactive with TNF receptor, the process comprising 
45 either (a) culturing a hybridoma cell expressing the antibodies and harvesting the antibodies, or (b) 

harvesting antibodies immunoreactive with TNF receptor from an appropriately immunised animal. 



Claims for the following Contracting State: GR 

50 1 . An isolated DNA sequence encoding a biologically active mammalian TNF receptor (TNF-R) protein. 

2. An isolated DNA sequence according to claim 1, selected from the group consisting of: 

(a) cDNA clones having a nucleotide sequence derived from the coding region of a native mammalian 
TNF-R gene; 

(b) DNA sequences capable of hybridization to the clones of (a) under moderately stringent conditions 
56 (50* C. 2 x SSC) and which encode biologically active TNF-R protein; and 

(c) DNA sequences which are degenerate as a result of the genetic code to the DNA sequences defined 
in (a) and (b) and which encode biologically active TNF-R protein. 

3. An isolated DNA sequence according to claim 1 which encodes a soluble human TNF-R protein. 
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4. An isolated DNA sequence according to claim 3, wherein the soluble human TNF-R protein has an amino 
acid sequence comprising the sequence of amino acid residues i-x of Figure 2A» wherein x is selected 
from the group consisting of amino acids 163-235 

5. An isolated DNA sequence according to claim 3, wherein the soluble human TNF-R protein comprises 
5 the sequence of amino adds 1 -235 of Rgure 2A. 

6. An isolated DNA sequence according to claim 3, wherein the soluble human TNF-R protein comprises 
the sequence of amino acids 1-185 of Rgure 2A. 

7. An isolated DNA sequence according to claim 3, wherein the soluble human TNF-R protein comprises 
the sequence of amino acids 1-163 of Rgure 2A. 

10 8. A DNA sequence according to claim 3. wherein amino acid residue 46 is selected from the group 
consisting of lie and Thr and amino acid residue 118 is selected from the group consisting of Val and He. 

9. A recombinant expression vector comprising a DNA sequence according to any one of claims 1 -7. 

10. A process for preparing a purified mammalian TNF receptor (TNF-R) protein, the process comprising 
coupling together successive amino acid residues by the formation of peptide bonds to form a TNF-R 

1$ polypeptide. 

11. A process according to claim 9, wherein the TNF-R protein is a soluble human TNF-R protein, 

12. A process according to claim 11. wherein the soluble human TNF-R protein has an amino acid 
sequence comprising the sequence of amino acid residues 1*x of Rgure 2A, wherein x is selected from the 
group consisting of amino acids 163-235. 

20 13. A process according to claim 11. wherein the soluble human TNF-R protein has an amio acid sequence 
which comprises the sequence of amino acid residues 1 -235 of Rgure 2A. 

14. A process according to claim 11. wherein the soluble human TNF-R protein has an amio acid sequence 
which comprises the sequence of amino acid residues 1 -1 85 of Figure 2A. 

15. A process according to claim 11, wherein the soluble human TNF-R protein has an amio acid sequence 
25 which comprises the sequence of amino acid residues 1-163 of Figure 2A. 

16. The use of a mammalian TNF-R protein in preparing a medicament for regulating immune responses in 
mammals. 

17. The use of a mammalian TNF-R protein in preparing a pharmaceutical composition suitable for 
parenteral admintstratjon to a human patient for regulating immune responses. 

30 18. Antibodies immunoreactive with mammalian TNF receptors. 



36 



40 



45 



50 



55 
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(2) INFORMATION FOR SEQ ID NO: I: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: I64X base pairs 

(B> TYPE: nuclaic acid 

(C) STBANDEDNBSS: single 

(D) TOPOIiOOT: lineai: 

<ii) MOLECULE TYPE: CDNA to mRNA 

<iii) HYPOTHETICAL: N 

(iv) ANTI-SENSE: N 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(G) CELL TYPE: Fibroblast 

(H) CELL LIN^: WI-26 VA4 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: WI-26 VA4 

(B) CLONE: 1 

(ix) FEATURE: 

(A) KAME/REY: CDS 

<B) LOCATION: 88.. 1473 

(D) OTHER INFORMATION: 
(ix) FEATURE: 

(A) NAME/KEY: mac_peptide 

<B) LOCATION: 88.. 1470 

<D) OTHER INFORMATION: 
(ix) FEATURE: 

(A) NAME/KEY: sigjpeptide 

<fi) LOCATION: 88. .153 

<D) OTHER INFORMATION: 

(X) PUBLICATION INFORMATION: 

(A) AUTHORS: Smith , Craig A. 

Davis, Terri 
Anderson, Dirk 
Solanti Lisabeth 
Beckniaonr M. P. 
Jerzy, Rita 
Dower, Stevoii 
Cosinan, David 
Goodwin, Raymond G. 

(B) TITLE: A Receptor for Tumor Necrosis Factor Defines an 

Unusual Family of Cellular and Viral Proteins 

(C) JOURNAL: Science 
<D) VOLUME: 246 

(F) PAGES: 1019-1023 

(G) DATE: 2S-MAY-ld90 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GCGAGGCAGG CAGCCTGGAG AGAAGGCGCT GGGCTGCGAG GGCGCGAGGG CGCGAGGGCA 60 

GGGGGCAACC GGACCCCGCC CGCATCC ATG GCG CCC GTC GCC GTC TGG GCC 111 

Met Ala Pro Val Ala Val Trp Ala 
1 5 

GCG CTG GCC GTC GGA CTG GAG CTC TGG GCT GCG GCG CAC GCC TTG CCC 159 
Ala L«u Ala Val Gly Leu Glu Leu Trp Ala Ala Ala His Ala Leu Pro 
10 15 20 
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GCC CAG GTG GCA TTT ACA CCC TAC GCC CCG GAG CCC GGG AGC ACA TGC 
Ala Gin V«l Ala Phe Thr Pro Tyr Ala Pro Glu Pro Gly Sar Thr Cys 
25 30 35 40 

CGG CTC AGA GAA TAC TAT GAC CAG ACA GCT CAG ATG TGC TGC AGC AAA 
Arg Leu Arg Glu Tyr Tyr Aap Gin Thr Ala Gin Met Cya Cys Ser Lya 
45 50 55 

TGC TCG CCG GGC CAA CAT GCA AAA GTC TTC TGT ACC AAG ACC TCG GAC 
Cys Ser Pro Gly Gin His Ala Lya Val Phe Cys Thr Lys Thr Ser Asp 
60 65 70 

ACC GTG TGT GAC TCC TGT GAG GAC AGC ACA TAC ACC CAG CTC TGG AAC 
Thr Val Cys Asp Ser Cys Glu Asp Ser Thr Tyr Thr Gin Leu Trp Asn 
75 90 85 

TGG GTT CCC GAG TGC TTG AGC TGT GGC TCC CGC TGT AGC TCT GAC CAG 
Trp val Pro Glu Cys Leu Ser Cys Gly Ser Arg Cys Ser Ser Asp Gin 
90 95 100 

GTG GAA ACT CAA GCC TGC ACT CGG GAA CAG AAC CGC ATC TGC ACC TGC 
Val Glu Thr Gin Ala Cya Thr Arg Glu Gin Aan Arg He Cys Thr Cys 
105 110 115 120 

AGG CCC GGC TGG TAC TGC GCG CTG AGC AAG CAG GAG GGG TGC CGG CTG 
Arg Pro Gly Trp Tyr Cya Ala Leu Ser Lys Gin Glu Gly Cys Arg Leu 
125 130 135 

TGC GCG CCG CTG CGC AAG TGC CGC CCG GGC TTC GGC GTG GCC AGA CCA 
Cys Ala Pro Leu Arg Lys Cys Arg Pro Gly Phe Gly Val Ala Arg Pro 
140 145 150 

GGA ACT GAA ACA TCA GAC GTG GTG TGC AAG CCC TGT GCC CCG GGG ACG 
Gly Thr Glu Thr Sec Asp Vel Val Cya Lys Pro Cys Ala Pro Gly Thr 
155 160 165 

TTC TCC AAC ACG ACT TCA TCC ACG GAT ATT TGC AGG CCC CAC CAG ATC 
Phe Ser Asn Thr Thr Ser Ser Thr Asp He Cya Arg Pro Hia Gin He 

170 175 180 • 

TGT AAC GTG GTG GCC ATC CCT GGG AAT GCA AGC ATG GAT GCA GTC TGC 
Cya Asn Val Val Ala He Pro Gly Aan Ala Ser Met Asp Ala Val Cys 
185 190 195 200 

ACG TCC ACG TCC CCC ACC CGG AGT ATG GCC CCA GGG GCA GTA CAC TTA 
Thr Ser Thr Ser Pro Thr Arg Ser Met Ala Pro Gly Ala Val His Leu 
205 210 215 



CCC CAG CCA GTG TCC ACA CGA TCC CAA CAC ACG CAG CCA ACT CCA GAA 
Pro Gin Pro Val Ser Thr Arg Ser Gin His Thr Gin Pro Thr Pro Glu 
220 225 230 

CCC AGC ACT GCT CCA AGC ACC TCC TTC CTG CTC CCA ATG GGC CCC AGC 
Pro Ser Thr Ala Pro Ser Thr Ser Phe Leu Leu Pro Met Gly Pro Ser 
235 240 245 

CCC CCA GCT GAA GGG AGC ACT GGC GAC TTC GCT CTT CCA GTT GGA CTG 
Pro Pro Ala Glu Gly Ser Thr Gly Asp Phe Ala Leu Pro Val Gly Leu 
250 255 260 

ATT GTG GGT GTG ACA GCC TTG GGT CTA CTA ATA ATA GGA GTG GTG AAC 
He val Gly Val Thr Ala Leu Gly Leu Leu He He Gly Val Val Asn 
265 270 275 280 
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TGT GTC ATC ATG ACC CAG GTG AAA AAG AAG CCC TTG TGC CTG CAG AGA 975 
Cy3 Vai He Met Thr Gin Val Lys Lya Lys Pro Leu Cya Leu Gin Arg 
285 290 295 

GAA GCC AAG GTG CCT CAC TTG OCT GCC GAT AAG GCC CGG GGT ACA CAG 1023 
Glu Ala Lys Val Pro His Leu Pro Ala Asp Lys Ala Arg Gly Thr Gin 
300 305 310 

GGC CCC GAG CAG CAG CAC CTG CTG ATC ACA GCG CCG AGC TCC AGC AGC 1071 
Gly Pro Glu Gin Gin His Leu Leu Zle Thr Ala Pro Ser Ser Ser Ser 
315 320 325 

AGC TCC CTG GAG AGC TCG GCC AGT GCG TTG GAC AGA AGG GCG CCC ACT 1119 
Ser Ser Leu Glu Ser Ser Ala Ser Ala Leu Asp Arg Arg Ala Pro Thr 
330 335 340 

CGG AAC CAG CCA CAG GCA CCA GGC GTG GAG GCC AGT GGG GCC GGG GAG 1167 
Arg Asn Gin Pro Gin Ala Pro Gly Val Glu Ala Ser Gly Ala Gly Glu 
345 350 355 360 

GCC CGG GCC AGC ACC GGG AGC TCA GAT TCT TCC CCT GGT GGC CAT GGG 1215 
Ala Arg Ala Ser Thr Gly Ser Ser Asp Ser Ser Pro Gly Gly His Gly 
365 370 375 

ACC CAG GTC AAT GTC ACC TGC ATC GTG AAC GTC TGT AGC AGC TCT GAC 1263 
Thr Gin Val Asn Val Thr Cys He Val Asn Val Cys Ser Ser Ser Asp 
380 385 390 

CAC AGC TCA CAG TGC TCC TCC CAA GCC AGC TCC ACA ATG GGA GAC ACA 1311 
His Ser Ser Gin Cys Ser Ser Gin Ala Ser Ser Thr Met Gly Asp Thr 
395 400 405 

GAT TCC AGC CCC TCG GAG TCC CCG AAG GAC GAG CAG GTC CCC TTC TCC 1359 
Asp Ser Ser Pro Ser Glu Ser Pro Lys Asp Glu Gin Val Pro Phe Ser 
410 415 420 

AAG GAG GAA TGT GCC TTT CGG TCA CAG CTG GAG ACG CCA GAG ACC CTG 1407 
Lys Glu Glu Cys Ala Phe Arg Ser Gin Leu Glu Thr Pro Glu Thr Leu 
425 430 435 440 

CTG GGG AGC ACC GAA GAG AAG CCC CTG CCC CTT GGA GTG CC " GAT GCT 1455 
Leu Gly Ser Thr Glu Glu Lys Pro Leu Pro Leu Gly Val Pro Asp Ala 
445 450 455 

GGG ATG AAG CCC AGT TAA CCAGGCCGGT GTGGGCTGTG TCGTAGCCAA 1503 
Gly Met Lys Pro Ser . 
460 

GGTGGGCTGA GCCCTGGCAG GATGACCCTG CGAAGGGGCC CTGGTCCTTC CAGGCCCCCA 1563 

CCACTAG6AC TCTGAGGCTC TTTCTGGGCC AAGTTCCTCT AGTGCCCTCC ACAGCCGCAG 1623 

CCTCCCTCTG ACCTGCAG 2.641 



(2) INfFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Pro Val Ala Val Trp Ala Ala Leu Ala Val Gly Leu Glu Leu 
15 10 15 

Trp' Ala Ala Ala His Ala Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr 
20 25 30 

Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu Arg Glu Tyr Tyr Aap Gin 
35 40 45 

Thr Ala Gin Met Cys Cys Ser Lys Cys Ser Pro Gly Gin His Ala Lys 
50 55 60 

Val Phe Cys Thr Lys Thr Ser Asp Thr Val Cys Asp Ser Cys Glu Asp 
65 70 75 80 

Ser Thr Tyr Thr Gin Leu Trp Asn Trp Val Pro Glu Cya Leu Ser Cys 
85 90 95 

Gly Ser Arg Cys Ser Ser Asp Gin Val Glu Thr Gin Ala Cys Thr Arg 
100 105 110 

Glu Gin Asn Arg He Cys Thr Cys Arg Pro Gly Trp Tyr Cys Ala Leu 
115 120 125 

Ser Lys Gin Glu Gly Cys Arg Leu Cys Ala Pro Leu Arg Lys Cys Arg 
130 135 140 

Pro Gly Phe Gly Val Ala Arg Pro Gly Thr Glu Thr Ser Asp Val Val 
145 150 155 160 

Cys Lys Pro Cys Ala Pro Gly Thr Phe Ser Asn Thr Thr Ser Ser Thr 
165 170 175 

Asp He Cys Arg Pro His Gin He Cys Asn Val Val Ala He Pro Gly 
180 185 190 

Asn Ala Ser Met Asp Ala Val Cys Thr Ser Thr Ser Pro Thr Arg Ser 
195 200 205 

Met Ala Pro Gly Ala Val His Leu Pro Gin Pro Val Ser Thr Arg Ser 
210 215 220 

Gin His Thr Gin Pro Thr Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser 
225 230 235 240 

Phe Leu Leu Pro Met Gly Pro Ser Pro Pro Ala Glu Gly Ser Thr Gly 
245 250 255 

Asp Phe Ala Leu Pro Val Gly Leu He Val Gly Val Thr Ala Leu Gly 
260 265 270 

Leu Leu He He Gly Val Val Asn Cys Val He Met Thr Gin Val Lys 
275 280 285 

Lys Lys Pro Leu Cys Leu Gin Arg Glu Ala Lys Val Pro His Leu Pro 
290 295 300 
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Gin Gin His Leu L«u 
320 



Glu Ser Ser Ala Ser 
335 

Pro Gin Ala Pro Gly 
350 

Ser Thr Gly Ser Ser 
365 

Asn Val Thr Cys lie 
380 

Gin Cys Ser Ser Gin 
400 

Pro Ser Glu Ser Pro 
415 



Cys Ala Phe Arg Ser 

430 

Thr Glu Glu Lys Pro 
445 

Pro Ser 
460 

(2) INFORMATION FOR SEQ ID NO: 3: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3813 base pairs 

(B) TYPE: nucleic acid 
fC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: N 

(iv) ANTI-SENSE: N 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: zitouse 

(B) STRAIN: C57BL/6 

(G) CELL TYPE: T-helper cell 

(H) CELL LINE: 7B9 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: 11 

(ix) FEATURE: 

(A) NAME/KEY: COS 

<B) LOCATION: 55.. 1479 

(D) OTHER INFORMATION: 
(ix) FEATURE: 

(A) NAME/KEY: inat_peptide 

(B) LOCATION: 55.. 1476 
(D) OTHER INFORMATION: 



Ala Asp Lys Ala Arg Gly Thr Gin Gly Pro Glu 
305 310 315 

lie Thr Ala Pro Ser Ser Ser Ser Ser Ser Leu 
325 330 

Ala Leu Asp Arg Arg Ala Pro Thr Arg Asn Gin 
340 345 

val Glu Ala Ser Gly Ala Gly Glu Ala Arg Ala 
355 360 

Asp Ser Ser Pro Gly Gly His Gly Thr Gin Val 
370 375 

Val Asn Val Cys Ser Ser Ser Asp His Ser Ser 
385 390 395 

Ala Ser Ser Thr Met Gly Asp Thr Asp Ser Ser 
405 410 

Lys Asp Glu Gin Val Pro Phe Ser Lys Glu Glu 
420 425 

Gin Leu Glu Thr Pro Glu Thr Leu Leu Gly Ser 
435 440 



Leu Pro Leu Gly Val Pro Asp Ala Gly Met Lys 
450 455 
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(ix) FEATORE: 

(A) MAMB/KEY: 3ig_j>eptide 

(B) LOCATION: 55.. 120 

(D> OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CGCAGCTGAG GCACTAGAGC TCCAGGCACA AGGGCGGGAG CCACCGCTGC CCCT ATG 57 

Met 
1 

GCG CCC GCC GCC CTC TGG GTC GCG CTG GTC TTC GAA CTG CAG CTG TGG 105 
Ala Pro Ala Ala Leu Trp Val Ala Leu Val Phe Glu Leu Gin Leu Trp 
5 10 15 

GCC ACC GGG CAC ACA GTG CCC GCC CAG GTT GTC TTG ACA CCC TAC AAA 153 
Ala Thr Gly His Thr Val Pro Ala Gin Val Val Leu Thr Pro Tyr Lys 
20 25 30 

CCG GAA CCT GGG TAC GAG TGC CAG ATC TCA CAG GAA TAC TAT GAC AGG 201 
Pro Glu Pro Gly Tyr Glu Cya Gin lie Ser Gin Glu Tyr Tyr Asp Arg 
35 40 45 

AAG GCT CAG ATG TGC TGT GCT AAG TGT CCT CCT GGC CAA TAT GTG AAA 249 
Lys Ala Gin Met Cys Cys Ala Lys Cys Pro Pro Gly Gin Tyr Val Lys 
50 55 €0 65 

CAT TTC TGC AAC AAG ACC TCG GAC ACC GTG TGT GCG GAC TGT GAG GCA 297 
His Phe cya Asn Lys Thr Ser Aap Thr Val Cys Ala Asp Cya Glu Ala 
70 75 80 

AGC ATG TAT ACC CAG GTC TGG AAC CAG TTT CGT ACA TGT TTG AGC TGC 345 
Ser Met Tyr Thr Gin Val Trp Asn Gin Phe Arg Thr Cys Leu Ser Cys 
85 90 95 

AGT TCT TCC TGT ACC ACT GAC CAG GTG GAG ATC CGC GCC TGC ACT AAA 393 
Ser Ser Ser Cys Thr Thr Aap Gin Val Glu lie Arg Ala Cys Thr Lys 
100 105 110 

CAG CAG AAC CGA GTG TGT GCT TGC GAA GCT GGC AGG TAC TGC GCC TTG 441 
Gin Gin Asn Arg Val Cya Ala Cys Glu Ala Gly Arg Tyr Cys Ala Leu 
115 120 125 

AAA ACC CAT TCT GGC AGC TGT CGA CAG TGC ATG AGG CTG AGC AAG TGC 489 
Lys Thr His Ser Gly Ser Cys Arg Gla Cys Met Arg Leu Ser Lys Cys 
130 135 140 145 

GGC CCT GGC TTC GGA GTG GCC AGT TCA AGA GCC CCA AAT GGA AAT GTG 537 
Gly Pro Gly Phe Gly Val Ala Ser Ser Arg Ala Pro Aan Gly Asn val 
150 155 160 

CTA TGC AAG GCC TGT GCC CCA GGG ACG TTC TCT GAC ACC ACA TCA TCC 585 
Leu Cys Lya Ala Cys Ala Pro Gly Thr Phe Ser Asp Thr Thr Ser Ser 
165 170 175 

ACT GAT GTG TGC AGG CCC CAC CGC ATC TGT AGC ATC CTG GCT ATT CCC 633 
Thr Asp Val Cys Arg Pro His Arg lie Cya Ser He Leu Ala He Pro 
180 185 190 
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GGA AAT GCA AGC ACA GAT GCA GTC TGT GCG CCC GAG TCC CCA ACT CTA 681 
Gly Ash Ala Ser Thr Asp Ala Val Cys Ala Pro Glu Ser Pro Thr Leu 
195 200 205 



AGT GCC ATC CCA AGG ACA CTC TAG GTA TCT CAG CCA GAG CCC ACA AGA 729 
Ser Ala lie Pro Arg Thr Leu Tyr Val Ser Gin Pro Glu Pro Thr Arg 
210 215 220 225 

TCC CAA CCC CTG GAT CAA GAG CCA GGG CCC AGC CAA ACT CCA AGC ATC 777 
Ser Gin Pro Leu Asp Gin Glu Pro Gly Pro Ser Gin Thr Pro Ser He 
230 235 240 

CTT ACA TCG TTG GGT TCA ACC CCC ATT ATT GAA CAA AGT ACC AAG GGT 825 
Leu Thr Ser Leu Gly Ser Thr Pro He lie Glu Gin Ser Thr Lys Gly 
245 250 255 

GGC ATC TCT CTT CCA ATT GGT CTG ATT GTT GGA GTG ACA TCA CTG GGT 873 
Gly He Ser Leu Pro He Gly Leu He Val Gly Val Thr Ser Leu Gly 
260 265 270 

CTG CTG ATG TTA GGA CTG GTG AAC TGC ATC ATC CTG GTG CAG AGG AAA 921 
Leu Leu Met Leu Gly Leu Val Asn Cys He He Leu Val Gin Arg Lys 
275 280 285 

AAG AAG CCC TCC TGC CTA CAA AGA GAT GCC AAG GTG CCT CAT GTG CCT 969 
Lys Lys Pro Ser Cys Leu Gin Arg Asp Ala Lys Val Pro His Val Pro 
290 295 300 305 

GAT GAG AAA TCC CAG GAT GCA GTA GGC CTT GAG CAG CAG CAC CTG TTG 1017 
Asp Glu Lys Ser Gin Asp Ala Val Gly Leu Glu Gin Gin His Leu Leu 
310 315 320 

ACC ACA GCA CCC AGT TCC AGC AGC AGC TCC CTA GAG AGC TCA GCC AGC 1065 
Thr Thr Ala Pro Ser Ser Ser Ser Ser Ser Leu Glu Ser Ser Ala Ser 
325 330 335 

GCT GGG GAC CGA AGG GCG CCC CCT GGG GGC CAT CCC CAA GCA AGA GTC 1113 
Ala Gly Asp Arg Arg Ala Pro Pro Gly Gly His Pro Gin Ala Arg Val 
340 345 350 

ATG GCG GAG GCC CAA GGG TTT CAG GAG GCC CGT GCC AGC TCC AGG ATT 1161 
Met Ala Glu Ala Gin Gly Phe Gin Glu Ala Arg Ala Ser Ser Arg He 
355 360 365 

TCA GAT TCT TCC CAC GGA AGC CAC GGG ACC CAC GTC AAC GTC ACC TGC 1209 
Ser Asp Ser Ser His Gly Ser His Gly Thr His Val Asn Val Thr Cys 
370 375 380 385 

ATC GTG AAC GTC TGT AGC AGC TCT GAC CAC AGT TCT CAG TGC TCT TCC 1257 
He Val Asn Val Cys Ser Ser Ser Asp His Ser Ser Gin Cys Ser Sex 
390 395 400 

CAA GCC AGC GCC ACA GTG GGA GAC CCA GAT GCC AAG CCC TCA GCG TCC 1305 
Gin Ala Ser Ala Thr Val Gly Asp Pro Asp Ala Lys Pro Ser Ala Ser 
405 410 415 

CCA AAG GAT GAG CAG. GTC CCC TTC TCT CAG GAG GAG TGT CCG TCT CAG 1353 
Pro Lys Asp Glu Gin Val Pro Phe Ser Gin Glu Glu Cys Pro Ser Gin 
420 425 430 
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TCC CCG TGT GAG ACT ACA GAG ACA CTG CAG AGC CAT GAG AAG CCC TTG 1401 
Ser Pro Cys Glu Thr Thr Glu Thr Leu Gin Sar Hia Glu Lys Pro Lou 
435 440 445 

CCC CTT GGT GTG CCG GAT ATG GGC ATG AAG CCC AGC CAA GCT GGC TGG 1449 
Pro Leu Gly Val Pro A«p Met Gly Met Lya Pro Ser Gin Ala Gly Trp 
450 455 460 465 



TTT GAT CAG ATT GCA GTC AAA GTG GCC TGA CCCCTGACAG GGGTAACACC 1499 
Phe Asp GXn lie Ala Val Lys Val Ala 



470 




475 








CTGCAAAGGG ACCCCCGAGA 


CCCTGAACCC 


ATGGAACTTC 


AT6ACTTTTG 


CTGGAXCCAT 


1559 


TTCCCTTAGT GGCTTCCAGA 


GCCCCAGTTG 


CAGGTCAAGT 


GAGGGCTGAG 


ACAGCTAGAG 


1619 


TGGTCAAAAA CTGCCATGGT 


GTTTTATGGG 


GGCAGTCCCA 


GGAAGTTGTT 


GCTCTTCCAT 


1679 


GACCCCTCTG GATCTCCTGQ 


GCTCTTGCCT 


GATTCTTGCT 


TCTGAGAGGC 


CCCAGTATTT 


1739 


TTTCCTTCTA AGGAGCTAAC 


ATCCTCTTCC 


ATGAATAGCA 


CAGCTCTTCA 


GCCTGAATGC 


1799 


TGACACTGCA GGGCGGTTCC 


AGCAAGTAGG 


AGCAAGTGGT 


GGCCTGGTAG 


GGCACAGAGG 


1859 


CCCTTCAGGT TAGTGCTAAA 


CTCTTAGGAA 


GTACCCTCTC 


CAAGCCCACC 


GAAATTCTTT 


1919 


TGATGCAAGA ATCAGAGGCC 


CCATCAGGCA 


GAGTTGCTCT 


GTTATAGGAT 


GGTAGOGCTG 


1979 


TAACTCAGTG GTCCAGTGTG 


CTTTTAGCAT 


GCCCTGGGTT 


TGATCCTCAG 


CAACACATGC 


2039 


AAAACGTAAG TAGACAGCAG 


ACAGCAGACA 


GCACAGCCAG 


CCCCCTGTGT 


GGTTTGCAGC 


2099 


CTCTGCCTTT GACTTTTACT 


CTGGTGGGCA 


CACAGAGGGC 


TGGAGCTCCT 


CCTCCTGACC 


2159 


TTCTAATGAG CCCTTCCAAG 


GCCACGCCTT 


CCTTCAGGGA 


ATCTCAGGGA 


CTGTAGAGTT 


2219 


CCCAGGCCCC TGCAGCCACC 


TGTCTCTTCC 


TACCTCAGCC 


TGGAGCACTC 


CCTCTAACTC 


2279 


CCCAACGGCT TGGTACTGTA 


CTTGCTGTGA 


CCCCAACGTG 


CATTGTCCGG 


GTTAGGCACT 


2339 


GTGAGTTGGA ACAGCTCATG 


ACATCGGTTG 


AAAGGCCCAC 


CCGGAAACAG 


CTAAGCCAGC 


2399 


TCTTTTGCCA AAGGATTCAT 


GCCGGTTTTC 


TAATCAACCT 


GCTCCCTAGC 


ATTGCCTGGA 


2459 


AGGAAAGGGT TCAGGAGACT 


CCTCAAGAAG 


CAAGTTCAGT 


CTCAGGTGCT 


TGGATGCCAT 


2519 


GCTCACCGAT TCCACTGGAT 


ATGAACTTGG 


CAGAGGAGCC 


TAGTTGTTGC 


CATGGAGACT 


2579 


TAAAGAGCTC AGCACTCTGG 


AATCAAGATA 


CTGGACACTT 


GGGGCCGACT 


TGTTAAGGCT 


2639 


CTGCAGCATC A6ACTGTAGA 


GGGGAAGGAA 


CACGTCTGCC 


CCCTGGTGGC 


CCGTCCTGGG 


2699 


ATGACCTCGG GCCTCCTAGG 


CAACAAAAGA 


ATGAATTGGA 


AAGGATGTTC 


CTGGGTGTGG 


2759 


CCTAGCTCCT GTQCTTGTGT 


GGATCCCTAA 


AGGGTGTGCT 


AAGGAGCAAT 


TGCACTGTGT 


2819 


GCTGGACAGA ATTCCTGCTT 


ATAAATGCTT 


TTTGTTGTTG 


TXTTGTACAC 


TGAGCCCTGG 


2879 


CTGAGCCACC CCACCCCACC 


TCCCATCCCA 


CCTTTACACG 


CCACTCTTGC 


ATGAGAACCT 


2939 


GGCTGTCTCC CACTTGTAGC 


CTGTGGATGC 


TGAGGAAACA 


CCCAGCCAAG 


TAGACTCCAG 


2999 


GCTTGCCCCT ATCTCCTGCT 


ATGAGTCTGG 


CCTCCTCATT 


GTGTTGTGGG 


AAGGAGACGG 


3059 
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GTTCTGTCAT CTCGGAACGC CCACACCGTG GATGTGAACA ATGGCTGTAC TAGCTTAGAC 3119 

CAGCTTAGGG CTCTGCATAT CACAGGAGGG GGAGCAGGGA ACAATTTGAG TGCTGACCTA 3179 

TAACACAGTT CCTAAAGGAT CGGGCAGTCC AGAATCTCCT CCTTCAGTGT GTGTGTGTGT 3239 

GTGTGTGTGT GTGTGTGTGT GTGTGTGTGT CCATGTTTGC ATGTATGTGT GTGCCAGTGT 3299 

GTGGAGGCCC GAGGTTGGCT TTGGGTGTGT TTGATCACTC TCCAQTTACT GAGGCGGGCT 3359 

CTCATCTGTA CCCAGAGCTT GCACATTTTC TAGTCTAACT TGATTCAGOiG ATCTCTGTCT 3419 

GCCTATGGAG GTGCTCAGGT TACAGGCAGG CTGCCATACC TGCCCGACAT TTACATGAAT 3479 

ACTAGAGATC TGAATTCTGG TCCTCACACT TGTATACCTG CATTTTATCC ACTAAGACAT 3539 

CTCTCCAAGG GCTCCCCCTT CCTATTTAAT AAGTTAGTTT TGAACTGGCA AGATGGCTCA 3599 

GTGGGTAAGG CAGTTTGCGG ACAAACCTGA TGACCTGAGT TGGATCCCTG ACCATAAGGT 3659 

AGAAGAGACC TGATTCCTGC AAGTTGTCCT CTGACCACCA CCCCATACAT GCTTCTGCAT 3719 

ATGTGCACAC ATCACATTCT TGCACACACA CTCACATACC ATAAATGTAA TAAATTTTTT 3779 

TAAATAAATT GATTTTATCT TTTAAAAAAA AAAA 3813 

(2) INFORMATION FOR SEQ ID NO: 4: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 475 amino acids 
<B} TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Mat Ala Pro Ala Ala Leu Trp Val Ala Leu Val Phe Glu Leu Gin Leu 
15 10 15 

Trp Ala Thr Gly Hia Thr Val Pro Ala '"^n Val Val Leu Thr Pro Tyr 
20 25 30 

Lya Pro Glu Pro Gly Tyr Glu Cys Gin He Ser Gin Glu Tyr Tyr Asp 
35 40 45 

Arg Lys Ala Gin Met Cys Cys Ala Lys Cys Pro Pro Gly Gin Tyr Val 
50 55 60 

Lys His Phe Cys Asn Lys Thr Ser Aap Thr Val Cys Ala Asp Cya Glu 
65 70 75 80 

Ala Ser Met Tyr Thr Gin Val Trp Aan Gin Pho Arg Thr Cya Leu Ser 
as 90 95 

Cya Ser Ser Ser Cya Thr Thr Asp Gin Val Glu He Arg Ala Cys Thr 
100 105 110 

Lys Gin Gin Asn Arg Val Cys Ala Cys Glu Ala Gly Arg Tyr Cya Ala 
115 120 125 
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Leu Lys Thr Hia Ser Gly S«r Cys Arg Gin Cy3 Met Arg Leu Ser Lys 
130 135 140 



Cys Gly Pro Gly Phe Gly Val Ala Ser Ser Arg Ala Pro Asn Gly Aan 
145 150 155 160 

Val Leu Cys Lys Ala Cya Ala Pro Gly Thr Phe Ser Aap Thr Thr Ser 
165 170 175 

Ser Thr Asp Val Cys Arg Pro His Arg He Cys Ser He Leu Ala He 
180 185 190 

Pro Gly Aan Ala Ser Thr Asp Ala Val Cys Ala Pro Glu Ser Pro Thr 
195 200 205 

Leu Ser Ala He Pro Arg Thr Leu Tyr Val Ser Gin Pro Glu Pro Thr 
210 215 220 

Arg Ser Gin Pro Leu Asp Gin Glu Pro Gly Pro Ser Gin Thr Pro Ser 
225 230 235 240 

He Leu Thr Ser Leu Gly Ser Thr Pro He He Glu Gin Ser Thr Lys 
245 250 255 

Gly Gly He Ser Leu Pro He Gly Leu He Val Gly Val Thr Ser Leu 
260 265 270 

Gly Leu Leu Met Leu Gly Leu Val Asn Cys He He Leu Val Gin Arg 
275 280 285 

Lys Lys Lys Pro Ser Cys Leu Gin Arg Asp Ala Lys Val Pro Hia val 
290 295 300 

Pro Asp Glu Lys Ser Gin Asp Ala Val Gly Leu Glu Gin Gin His Leu 
305 310 315 320 

Leu Thr Thr Ala Pro Ser Ser Ser Ser Ser Ser Leu Glu Ser Ser Ala 
325 330 335 

Ser Ala Gly Asp Arg Arg Ala Pro Pro Gly Gly His Pro Gin Ala Arg 
340 345 350 

Val Met Ala Glu Ala Gin Gly Phe Gin Glu Ala Arg Ala Ser Ser Arg 
355 360 365 

He Ser Asp Ser Ser His Gly Ser His Gly Thr His Val Asn Val Thr 
370 375 380 

Cys He Val Asn Val Cys Ser Ser Ser Aap His Ser Ser Gin Cys Ser 
385 390 395 400 

Ser Gin Ala Ser Ala Thr Val Gly Asp Pro Asp Ala Lys Pro Ser Ala 
405 410 415 

Ser Pro Lys Asp Glu Gin Val Pro Phe Ser Gin Glu Glu Cys Pro Ser 
420 425 430 

Gin Ser Pro Cys Glu Thr Thr Glu Thr Leu Gin Ser His Glu Lys Pro 
435 440 445 

Leu Pro Leu Gly Val Pro Asp Met Gly Met Lys Pro Ser Gin Ala Gly 
450 455 460 
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Trp Phe Asp Qln lie Ala Val Lys Val Ala 
465 470 475 
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linaA 

1 B1IMF-SA2I9 

) B«THF-BAi63 

^ 1 H«niF-IiA142 

^ 1 I MalMF-R 
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GCGAGGCAGGCAGCCTGGAGA6MGGCG 28 

CTGGGCTGCGAGGGCGCGAGGGCGCGXGGGCAGGGGGCAACCGGACCCCGCCCGCATCC 6 7 

ATG GCG CCC GTC GCC 6TC TGG GCC GCG CTG GCC GTC G6A CTG GAG 132 

Mat Ala Pro Val Ala Val Trp Ala Ala Isn Ala Val Gly Ii«u Glu -8 

CTC TGG GCT GCG GCG CAC GCC ^TG CCC GCC CAG GTG GCA TTT ACA 177 

L«u Trp Ala Ala Ala His Ala Pro jAla Gin Val Ala Phe Thr 8 

CCC TAC GCC CCG GAG CCC GGG AGC ACa'tGC CGG CTC AGA GAA TAC 222 

Pro Tyr Ala Pro Glu Pro Gly Sar Thr Cya Arg Lau Arg Glu Tyr 23 

TAT GAC CAG ACA GCT CAG ATG TOC TGC AGC AAA TOC TCQ CCG GGC 267 

Tyr Aap Gin Thr Ala Gin Mat Cya Cya Sar Lys Cya Sar Pro Gly 38 

CAA CAT GCA AAA GTC TTC T6T ACC AAG ACC TCG GAC ACC GTG TGT 312 

Gin His Ala Lys Val Pha Cys Thr Lys Thr Sar Asp Thr Val Cys 53 

GAC TCC TGT GAG GAC AGC ACA TAC ACC CAG CTC TGG AAC TGG GTT 357 

Asp Sar Cys Glu Asp Sar Thr Tyr Thr Gin Lau Trp Asn Trp Val 68 

CCC GAG TGC TTG AGC TGT GGC TCC CGC TGT AGC tCT GAC CAG GTG 402 

Pro Glu Cys Leu. Sar Cys Gly Sar Arg Cya Sar Sar Asp Gin Val 83 

GAA ACT CAA GCC TGC ACT CGG GAA CAG AAC CGC ATC TGC ACC TGC 447 

Glu Thr Gin Ala Pys Thr Arg Glu Gin Asn Arg Ila Cys Thr Cya 98 

AGG CCC GGC TGG TAC TGC GCG CTG AGC AAG CAG GAG GGG TGC CGG 492 

Arg Pro Gly Trp Tyr Cys Ala Lau Sar Lys Gin Glu Gly Cys Arg 113 

CTG TGC GCG CCG CTG CGC AAG TGC CGC CCG GGC TTC GGC GTG GCC 537 

Lau Cys Ala Pro Lau Arg Lys Cys Arg Pro Gly Pha Gly Val Ala 128 

AGA CCA GGA ACT GAA ACA TCA GAC GTG GTG TGC AAG CCC TGT GCC 582 

Arg Pro Gly Thr Glu Thr Sar Asp Val Val Cys Lys Pro Cya^Ala 143 

CCG GOG ACG TTC TCC AAC ACQ ACT TCA TCC ACQ GAT ATT TGC AGG 627 

Pro Gly Thr Pha Sar Asn Thr Thr Sar Sar Thr Asp Ila Cys Arg 158 

CCC CAC CAG ATC TGT AAC GTG GTG GCC ATC CCT GGG AAT GCA AGC 672 

Pro His Gin Ila Cys^Asn Val Val Ala Ila Pro Gly Asn Ala Sar 173 

Afg^GAT GCA GTC TGC ACG TCC ACQ TCC CCC ACC CGG AGT ATG GCC 717 

6sat,Asp Ala Val Cys Thr Sar Thr Sar Pro Thr Arg^Sar Mat Ala 188 

CCA GGG GCA GTA CAC TTA CCC CAO CCA GTG TCC ACA CGA TCC CAA 762 

Pro Gly Ala Val His Lau Pro Gin Pro Val Sar Thr Arg Sar Gin 203 

CAC ACQ CAG CCA ACT CCA GAA CCC AGC ACT GCT CCA AGC ACC TCC 807 

His Thr Gin Pro Thr Pro Glu Pro Sar Thr Ala Pro Sar Thr Sar 218 

TTC CTG CTC CCA ATG GGC CCC AGC CCC CCA GCT GAA GGG AGC ACT 852 

Pha Lau Lau Pro Mat Gly Pro Sar Pro Pro Ala Glu Gly Sar Thr 233 

GGC GAC TTC GCT CTT CCA GTT GGA CTG ATT GTG GGT GTG ACA GCC 897 

Gly Asp ?hA XlM Lftii Pro VAt Qly tftu Tift Val fllv Val Thr Ala 248 
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TTG 


GGT CTA CTA ATA ATA 

Glv L«« T^M Tl« Tl« 


GGA GTG 

Glv Val 


GTG 

val 


AAC 

.dAaa 


TGT 


GTC 


ATC 

TlA 


ATG 


ACC 
.lh£ 


942 
263 


CAG 


GTG AAA 


AAG 


AAG 


CCC 


TTG 


TGC 


CTG 

WAV 


CAG 

WAV 


AGA 


GAA 

vnn 


GCC 


AAG 


GTG 




G^*^ v*^ Lys 




Lva 


Pro 


Lau 


Cva 


Lau 


Gin 


Arg 


Glu 


Ala 


Lys 


Val 


278 


CCT 


CXC TTG 

>«AW AAV 


CCT 


GCC 


GAT 


AAG 


GCC 




uvX 


AWA 


UAv 


vvw 


Www 


vAv 




Pro 


His L#U 




Ala 

AAA 






AIa 


Arg 


Gly 


Thr 


Gin 


Gly 


Pro 


Glu 


293 




WiU CAW 


CTG 


CTG 


ATC 


ACA 


GCG 


CCG 


AGC 


TCC 


AGC 


AGC 


AGC 


TCC 


1077 


Gin 


Gin His 


Lau 


Lau 




xnr 


Axa 


Pro 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


308 


CTG 

WAV 


GAG AGC 


TCG 


GCC 


AGT 


GCG 


TTG 


GAC 


AGA 


AGG 


GCG 


CCC 


ACT 


CGG 


1122 


L«tt 


Olu S«r 


sar 


& 1 • 
AX a 


sar 


Axa 


£iau 


Aap 


Arg 


Arg 


Ala 


Pro 


Thr 


Arg 


323 




nxG CCA 


CAG 


GCA 


CCA 


GGC 


GTG 


GAG 


GCC 


AGT 


GGG 


GCC 


GGG 


GAG 


1167 


Asn 


Gin ?ro 


V9xn 


Axa 


rro 


uxy 


vax 


Glu 


Ala 


Ser 


Gly 


Ala 


Gly 


Glu 


338 




CGG GCC 


AGC 


ACC 


GGG 


AGC 


TCA 


GAT 


TCT 


TCC 


CCT 


GGT 


GGC 


CAT 


1212 


Ala 


Axg Ala 


sar 




wAy 


sar 


sar 


Aap 


Ser 


Ser 


Pro 


Gly 


Gly 


Hia 


353 


GGG 


ACC CAG 


GTC 


AAT 


GTC 


ACC 


TGC 


ATC 


GTG 


AAC 


GTC 


TGT 


AGC 


AGC 


1257 


Gly 


Thr Gin 






VAX 




Cvs 
wya 


lie 


Val 


Aan 


Val 


Cya 


Ser 


Ser 


368 


TCT 


GAC CAC 


AGC 


TCA 


CAG 


TGC 


TCC 


TCC 


CAA 




AvC 




AwA 


ATv 


1 JU^ 


S«r 


Aap Hia 


Ser 


Sar 


Gin 


Cya 


Sar 


Ser 


Gin 


Ala 


Ser 


Ser 


Thr 


Met 


383 


GGA 


GAC ACA 


GAT 


TCC 


AGC 


CCC 


TCG 


GAG 


TCC 


CCG 


AAG 


GAC 


GAG 


CAG 


1347 


Gly 


Asp Thr 


A3P 


Sar 


Sar 


Pro 


Sar 


Glu 


Ser 


Pro 


Lya 


Aap 


Glu 


Gin 


398 


6TC 


CCC TTC 


TCC 


AAG 


GAG 


GAA 


TGT 


GCC 


TTT 


CGG 


TCA 


CAG 


CTG 


GAG 


1392 


val 


Pro Phe 


Sar 


Lys 


Glu 


Glu 


Cys 


Ala 


Phe 


Arg 


Ser 


Gin 


Leu 


Glu 


413 


ACG 


CCA GAG 


ACC 


CTG 


CTG 


GGG 


AGC 


ACC 


GAA 


GAG 


AAG 


CCC 


CTG 


CCC 


1437 


Thr 


Pro Glu 


Thr 


Lau 


Lau 


Gly 


Sar 


Thr 


Glu 


Glu 


Lya 


Pro 


Leu 


Pro 


428 


CTT 


GGA GTG 


CCT 


GAT 


GCT 


GGG 


ATG 


AAG 


CCC 


AGT 










1470 


Leu 


Gly Val 


Pro 


Aap 


Ala 


Gly 


Mat 


Lya 


Pro 


Ser 










439 



TAACCAGGCCGGTGTGGGCTGTGTCOTAGCCAAGGTGGGCTGAGCCCTGGCAQGATGAC 



CCTGCGAAGGGGCCCTGGTCCTTCCAGGCCCCCACCACTAGGACTCTGAGGCTCTTTCT 
GGGCCAAGTTCCTCTAGTGCCCTCCACAGCCGCAGCCTCCCTCTQACCTGCAG, . . 
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CGCAGCTGAG6CACTAGAGCTCC 


23 


AGGCACAAGGOCGGGAGCCACCGCTGCCCCT 


ATG 
Met 


GCG 
Ala 


CCC 
Pro 


GCC 
Ala 


GCC 
Ala 


CTC 
Leu 


TGG 
Trp 


75 
-16 


V«l 


Ala 


CTS 

Leu 


GTC 
Val 


TTC 
Phe 


GAA 

Glu 


CTG 
Leu 


CAG 
Gin 


CTG 
Leu 


TGG 
Trp 


GCC 
Ala 


ACC 
Thr 


GGG 
Gly 


CAC 
Hia 


ACA 

Thr 


120 
-1 


vyiTU 


Pro 


sec 

www 

Alt 


CAG 

V«AW 

Gin 


GTT 
Val 


GTC 

Val 


TTG 
Leu 


ACA 
Thr 


CCC 
Pro 


TAC 
Tyr 


AAA 

Lys 


CCG 
Pro 


GAA 

Glu 


CCT 
Pro 


GGG 
Gly 


165 
15 


TaC 
Tyr 


Qlu 


TTjC 

Cys 


CAw 

Gin 


ft<r^ 
ATw 

He 


Ser 


WAW 

Gin 


GAA 

Glu 


TAC 
Tyr 


TAT 

Tyr 


GAC 
Wn^ 

A8P 


AGG 
AWW 

Arg 


AAG 

AAw 

Lys 


GCT 

WW A 

Ala 


CAG 
Gin 


210 
30 


ATQ 
Mt 


Cys 


TOT 
Cys 


vjwT 

Ala 


AAA 
AAw 

Lys 


TGT 
Cys 


Wwl 

Pro 


CCT 
Pro 


GGC 
Gly 


CAA 

Gin 


TAT 

Tyr 


GTG 
Val 


AAA 

AAA 

Lys 


CAT 

WA* 

His 


TTC 
Phe 


255 
45 


TGC 

Cys 


AAw 
Asn 


ft ft/2 

Lys 


AWC 

Thr 


Tww 

Ser 


UAW 

Asp 


ACC 

A^^ 

Thr 


GTG 
VJl w 

val 


TGT 
Cya 


GCG 
www 

Ala 


GAC 

WAW 

Asp 


TGT 
Cys 


GAG 

WAW 

Glu 


GCA 
Ala 


AGC 
Ser 


300 
60 


Met 


Tyr 


Thr 


CAG 

WAU 

Gin 


GTC 

WA w 

Val 


TGG 
Trp 


AAC 
Asn 


CAG 
Gin 


TTT 
Phe 


CGT 

WW 4 

Arg 


ACA 
Thr 


TGT 

Cys 


TTG 
Leu 


AGC 
Ser 


TGC 
Cys 


345 

75 


S«r 


TCT 
Ser 


Ser 


TGT 

Cys 


ACC 

AWW 

Thr 


ACT 

AW A 

Thr 


GAC 
Asp 


CAG 
Gin 


GTG 
Val 


GAG 
Glu 


ATC 

lie 


CGC 
Arg 


GCC 
Ala 


TGC 
Cys 


ACT 
Thr 


390 
90 


AAA 

Lys 


Gin 


can 

WA\J 

Gin 


AAC 

AAW 

Asn 


CGA 

wWA 

Arg 


GTG 

WAV 

Val 


TGT 
Cys 


GCT 
Ala 


TGC 
Cys 


GAA 
Glu 


GCT 
Ala 


GGC 
Gly 


AGG 
Arg 


TAC 
Tyr 


ToC 
Cys 


435 
105 


Ala 


TTG 

Leu 


AAA 
Ly« 


ACC 
Thr 


CAT 
His 


TCT 
Ser 


GGC 
Gly 


AGC 
Ser 


TGT 
Cys 


CGA 
Arg 


CAG 
Gin 


TGC 
Cys 


ATG 
Met 


AGG 
Arg 


CTG 
Leu 


480 
120 


Ser 


aA6 
Lys 


TGC 

Cys 


GGC 
Gly 


CCT 
Pro 


GGC 
Gly 


TTC 
Phe 


GGA 
Gly 


GTG 
Val 


GCC 
Ala 


AGT 
Ser 


TCA 
Ser 


AGA 
Arg 


GCC 
Ala 


CCA 
Pro 


525 
135 


AAT 

Asn 


GGA 
Gly 


AAT 
Asn 


GTG 
Val 


CTA 
Leu 


TGC 
Cys 


A^G 
Lys 


GCC 
Ala 


TGT 
Cys 


GCC 
Ala 


CCA 
Pro 


GGG 
Gly 


ACG 
Thr 


TTC 
Phe 


TCT 
Ser 


570 
150 


GAC 
Aap 


ACC 
Thr 


ACA 

Thr 


TCA 
Ser 


TCC 
Ser 


ACT 
Thr 


GAT 
Asp 


GTG 

Val 


TGC 
Cys 


AGG 
Arg 


CCC 
Pro 


CAC 
His 


CGC 
Arg 


ATC 

tie 


TGT 
Cys 


615 
165 


AGC 
S«r 


ATC 
lie 


CTG 
Leu 


GCT 
Ala 


ATT 
He 


CCC 
Pro 


GGA 
Gly 


AAT 
Asn 


GCA 
Ala 


AGC 
Ser 


ACA 
Thr 


GiLT 
Asp 


GCA 
Ala 


GTC 
Val 


TGT 
Cys 


660 
180 


GC6 
Ala 


ccc 

Pro 


GAG 
Glu 


TCC 
Ser 


CCA 
Pro 


ACT 
Thr 


CTA 
Leu 


AwT 

Ser 


GCC 

Al» 


ATC 
lie 


CCA 
Pro 


AGG 
Arg 


ACA 

Thr 


CTC 
Leu 


TAC 
Tyr 


705 
195 


GTA 
Val 


TCT 
Ser 


CAG 
Gin 


CCA 
Pro 


GAG 

Glu 


CCC 
Pro 


ACA 
Thr 


AGA 
Arg 


TCC 
Ser 


CAA 

Gin 


CCC 

Pro 


CTG 
Leu 


GAT 
Asp 


CAA 

Gin 


GAG 
Glu 


750 
210 


CXA 
Pro 


GOG 
Gly 


CCC 
Pro 


AGC 
Ser 


CAA 

Gin 


ACT 
Thr 


CCA 
Pro 


AGC 
Ser 


ATC 
lie 


CTT 
Leu 


ACA 

Thr 


TCG 
Ser 


TTG 
Leu 


GGT 
Gly 


TCA 
Ser 


795 
225 


ACC 
Thr 

ATT 
lift. 


ccc 

Pro 
GGT 


ATT 
lie 

CTG 


ATT 
He 

ATT 


GAA 

Glu 

GTT 


CAA 

Gin 

GGA 


AGT 
Ser 

GTG 
.yaL 


ACC 
Thr 


AAG 


GGT 
Glv 


GGC 


ATC 

tlA 


TCT 


CTT 


CCA 
Pro 


840 
240 


ACA 


TCA CTG 


GGT 

Gly 


CTG CTG ATQ TTA 


885 
255 
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GGA CTG 


GTG 


AAC TGC 


ATC ATC 


CTG 


GTG 
JUL 


CAG 

J31a 


AGG 
Arg 


AAA 

Lya 


AAG 
Lya 


AAG 
Lys 


CCC 
Pro 


930 
270 


TCC TOC 
Sar Cya 


CTA 
Lau 


CXA 

Gin 


AGA 
Arg 


GAT 
Aap 


GCC 
Ala 


AAG 
Lya 


GTG 
Val 


CCT 
Pro 


CAT 
Hia 


GTG 
Val 


CCT 
Pro 


GAT 
Aap 


GAG 

Glu 


975 
283 


XXX TCC 
Lys Sax 


CAG 
Gin 


GAT 
Aap 


GCA 
Ala 


GTA 
Val 


GGC 
Gly 


CTT 
Lau 


GAG 
Glu 


CAG 
Gin 


CAG 
Gin 


CAC 
Hia 


CTG 
Lau 


TTG 
Lau 


ACC 
Thr 


1020 
300 


XCX GCX 
Thr Ala 


CCC 
Pro 


AGT 
Sar 


TCC 
Sar 


AGC 
Sar 


AGC 
Sar 


AGC 
Sar 


TCC 
Sar 


CTA 
Lau 


GAG 
Glu 


AGC 
Sar 


TCA 
Sar 


GCC 
Ala 


AGC 
Sar 


1065 
315 


GOT GGG 
Ala 61y 


GAC 
Aap 


CGA 
Arg 


AGG 
Arg 


GCG 
Ala 


CCC 
Pro 


CCT 
Pro 


GGG 
Gly 


GGC 
Gly 


CAT 
Hia 


CCC 
Pro 


CAA 

Gin 


GCA 
Ala 


AGA 
Arg 


1110 
330 


OTC ATG 
Val Mat 


GCG 
Ala 


GAG 
Glu 


GCC 
Ala 


CAA 

Gin 


GGG 
Gly 


TTT 
Pha 


CAG 

Gin 


GAG 
Glu 


GCC 
Ala 


CGT 
Arg 


GCC 
Ala 


AGC 
Sar 


TCC 
Sar 


1155 
345 


AGG ATT 
Arg Zla 


TCA 
Sar 


GAT 
Aap 


TCT 
Sar 


TCC 
Sar 


CAC 
Hia 


GGA 
Gly 


AGC 
Sar 


CAC 
Hia 


GGG 
Gly 


ACC 
Thr 


CAC 
Hia 


GTC 
Val 


AAC 

Aan 


1200 
3S0 


GTC ACC 
Val Thr 


TGC 
Cya 


ATC 
Xla 


GTG 
Val 


AAC 
Aan 


GTC 
Val 


TGT 
Cya 


AGC 
Sar 


AGC 
Sar 


TCT 
Sar 


GAC 
Aap 


CAC 
Hia 


AGT 
Sar 


TCT 
Sar 


1245 
375 


CAG TGC 
Gin Cya 


TCT 
Sar 


TCC 
Sar 


CAA 

Gin 


GCC 
Ala 


AGC 
Sar 


GCC 
Ala 


ACA 

Thr 


GTG 
Val 


GGA 
Gly 


GAC 
Aap 


CCA 
Pro 


GAT 
Aap 


GCC 
Ala 


1290 
390 


AAG CCC 
Lya Pro 


TCA 
Sar 


GCG 
Ala 


TCC 
Sar 


CCA 
Pro 


AAG 
Lya 


GAT 
Aap 


GAG 
Glu 


CAG 
Gin 


GTC 
Val 


CCC 
Pro 


TTC 
Pha 


TCT 
Sar 


CAG 
Gin 


1335 
405 


GAG GAG 
Glu Glu 


TGT 
Cya 


CCG 
Pro 


TCT 
Sar 


CAG 

Gin 


TCC 

Sar 


CCG 
Pro 


TGT 
Cya 


GAG 
Glu 


ACT 
Thr 


ACA 
Thr 


GAG 
Glu 


ACA 

Thr 


CTG 
Lau 


1380 
420 


CAG AGC 
GlQ Sar 


CAT 
Hia 


GAG 
Glu 


AAG 
Lya 


CCC 
Pro 


TTG 
Lau 


CCC 
Pro 


CTT 
Lau 


GGT 
Gly 


GTG 
Val 


CCG 
Pro 


GAT 
Aap 


ATG 
Mat 


GGC 
Gly 


1425 
435 


ATG AAG 
Mat Lya 


CCC 
Pro 


AGC 
Sar 


CAA 

Gin 


GCT 
Ala 


GGC 
Gly 


TGG 
Trp 


TTT 
Pha 


GAT 
Aap 


CAG 
Gin 


ATT 
lla 


GCA 
Ala 


GTC 
Val 


AAA 

Lya 


1470 
450 


GTG GCC 
Val Ala 




























1476 
452 



TGACCCCTQACAGGGGTAACACCCTGCAAAGGGACCCCCGAGACCCTGAACCCATGQAAC 1536 

TTCATGACTTTTGCTGGATCCATTTCCCTTAGTGGCTTCCAGAGCCCCAGTTGCAGGTCA 1596 

AGTGAGGGCTGAGACAGCTAGAGTGGTCAAAAACTGCCATGGTGTTTTATGGGGGCAGTC 1656 

CCAOGAAOTTGTTGCTCTTCCATGACCCCTCTGGATCTCCTGGGCTCTTGCCTGATTCTT 1716 

GCTTCTGAOAGGCCCCAGTATTTTTTCCTTCTAAGGAGCTAACATCCTCTTCCATGAATA 1776 

GCACAGCTCTTCAGCCTGAATGCTGACACTGCAGGGCGGTTCCAGCAAGtAGGAGCAAGT 1836 

GGTGGCCTGGTAOGGCACAGAGGCCCTTCAGGTTAGTGCTAAACTCTTAGGAAGTACCCT 1896 

CTCCAAGCCCAOCGAAATTCTTTTGATGCAAGAATCAQAGGCCCCATCAGGCAGAGTTGC 1936 

TCTGTTATAGQATGGTAGGGCTGTAACTCAgTGGTCCAgTGTGCTTTTAGCATGCCCTGG 2016 

GTTTGATCCTOUSCAACACATGCAAAACGTAAGTAGACAQCAGACAGCAGACAGCACAGC 2076 

CAGOCCCCT U TGT QG TTTQCAGCCTCTGCCTTTGACTTTTACTCTGQTOGqCACACAGAQ 2136 

GGCTGGAGCTCCTCCTCCTGACCTTCTAATGAGCCCTTCCAAGGCCACGCCTTCCTTCAG 2196 

GGAATCTCAOQGACTOTAGAGTTCCCAGGCCCCTGCAGCCACCTGTCTCTTCCTACCTCA 2256 

GCCTGGAGCAcTCCCTCTAACTCCCCAACGgCTTGGTACTGTACTTGCTGTGACCCCAAC 2316 

GTGCATTGTCCOGOTTAGQCACTGTGAGTTGGAACAGCTcATGACATCGGTTGAAAGGCC 2 37 6 

CACCCOGAAACAGCTAAGCCAGCTCTTTTGCCAAAGGATTCATGCCGGTTTTCTAATCAa 2436 

CCTGCTCCCTAGCATTGCCTGGAAGGAAAGGGTTCAGGAGACTCCTCAAGAAGCAAGTTC 2496 

AGTCTCAGGTGCTTGGATGCCATGCTCACCGATTCCACTGGATATGAACTTGGCAGAGGA 2556 
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GCCTAQTTGTTGCCXTGGAGXCTTAAAGAGCTC^GCACTCTGGAATCAAGATACTGG^ 2616 

CTTGGGGCCGACTTGTTAAGGCTCTGCAGaiTCAGXCTGTAGAGGGGXAGGAACACGTCT 2676 

GCCCCCTGGTa«CCGTCCTGG«tGACCTCG<Wa:t CCTAGGCAACAAAAGAATGAATT 2736 

G(aUUlGGATGTTCCTGGGT0T00CCtAGCTCCTGTGCTTGTGTt3» 2796 

GCTAAGGAGCAATTGCACTGTGTGCTGGACAGXATTCCTGCTTATAAATGCTTTTTCT^ 2856 

TTGTTTTCTACACTGAOCCCTGGCTGJiGCCACCX 2916 

ACGCXJU:TCtTGCAT<aM»ACCTGGCTGTCTCCCACTTGTAGCXTGT^ 2976 

ACACCCAOCCAAOTAGACTCCXGGCTTgCCCCTATCTOCTGcTaTGAGTcTggOCTCCTC 3036 

AtXgTOTTGTGGQUigGAGACGGGtTCTGTCATCTCGGAAcgCCCACAC^^ 30 96 

JUyUTOGCTOTACTAGCTTAQWX»^TTAGGGCTCTGCAIATCA 3156 

GGAACJUiTTTaACWX»SACCTATAACACAgTTCCTAAAGGATCOGGCAGT(^^ 3216 

CCTCCTTCAGTGTGTGTGTGTGTGTGTGTGTGTGTGTQTGTGTGTGTGTGTGTCCATGTT 3276 

TGCATGTATGTGT0TG0CAGTGTGTGGAGGCCC6WWTTGGCTTTGGGTGTGTTTGAT» 3336 

CTCTCCAGTTACTGAGGCGGGCTCTCATCTGTACCCAGAGCTTGCACAXTTTCTAGTC^ 3396 

ACTTGATTCAGGGATCTCTGTCTGCCTATGGAGgTGCTCAGGmCAGGC^^ 3456 

ACCTGCXCGACATTTAaiTGAATACtAGAQATCTGAATTCTGQTCCTC^ 3516 

CTGCATTTTATCCACTAAQACATCTCTCCAAGGGCTCCCCCTTCCTATTTAATAAGTTAG 3576 

TTTTGAACTGGCAAGATGGCTCAGTGGGTAAGGaGTTTGCGGAa^CCTQAr^ 3636 

AGTtGGATCCCTGACOlTAAGGTAGAAGAGACCTGATTCCTGCiUlGTTGTCCTCTGACCA 3696 

CCACCCCATACATGCTTCTGCATATGTGCACACATCACATTCTTGCaCACACACTCACAT 3756 

ACCATAAATGTAATAAATTTTTTTAAATAAATTGATTTTATCTTTTAAAAAAAAAAA 3813 
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